arXiv: 1502.04607v2 [math.CA] 27 Apr 2015 


Some aspects of analysis related to 
j9-adic numbers, 2 


Stephen Semmes 
Rice University 


Preface 


Some aspects of analysis involving fields with absolute value functions are dis¬ 
cussed, which includes the real or complex numbers with their standard absolute 
values, as well as ultrametric situations like the p-adic numbers. 


11 



Contents 


1 Basic notions 1 

1.1 Metrics and ultrametrics. 1 

1.2 Quasimetrics. 2 

1.3 Absolute value functions. 4 

1.4 Completeness. 6 

1.5 Quasimetric absolute value functions. 7 

1.6 The archimedian property. 8 

1.7 Topological equivalence . 10 

1.8 Ostrowski’s theorems. 11 

1.9 Discrete absolute value functions. 12 

1.10 Nonnegative sums . 13 

1.11 Nonnegative sums, continued. 15 

2 Norms on vector spaces 17 

2.1 Norms and ultranorms. 17 

2.2 The supremum norm. 18 

2.3 r Norms . 20 

2.4 Bounded linear mappings. 22 

2.5 Infinite series. 24 

2.6 Generalized convergence. 25 

2.7 Generalized convergence, continued. 27 

2.8 Bounded finite sums. 29 

2.9 Sums of sums. 32 

2.10 Finite-dimensional vector spaces . 34 

2.11 g-Norms. 36 

2.12 P" Norms, continued. 39 

3 Additional examples and results 41 

3.1 Cauchy products. 41 

3.2 Formal power series . 43 

3.3 Geometric series . 46 

3.4 Formal Laurent series . 47 

3.5 p-Adic integers. 50 

3.6 Radius of convergence. 51 

iii 
































iv CONTENTS 

3.7 Compositions. 54 

3.8 Compositions, continued. 58 

3.9 Changing centers. 61 

3.10 The residue field. 63 

4 Geometry of mappings 66 

4.1 Differentiation . 66 

4.2 Mappings between metric spaces. 68 

4.3 fc-Valued functions. 70 

4.4 Lipschitz mappings. 71 

4.5 Lipschitz mappings, continued. 73 

4.6 Differentiation, continued. 75 

4.7 Derivative 0. 76 

4.8 Some related estimates. 78 

4.9 Some related estimates, continued . 80 

4.10 Hensel’s lemma. 82 

4.11 Some variants. 84 

4.12 Some variants, continued. 86 

4.13 A basic situation. 86 

4.14 Some examples. 88 

4.15 Some examples, continued. 89 

5 Some additional topics 92 

5.1 Sums of functions. 92 

5.2 Lipschitz seminorms. 94 

5.3 The product rule. 97 

5.4 The chain rule .100 

5.5 Functions of sums .103 

5.6 The logarithm.106 

5.7 The usual identity.108 

5.8 Some additional properties .Ill 

5.9 Some additional properties, continued.112 

5.10 Outer measures.115 

5.11 Hausdorff measures.119 

5.12 Hausdorff measures, continued .123 

5.13 Lipschitz mappings, revisited.125 

5.14 Local Lipschitz conditions.126 

129 


Bibliography 




































Chapter 1 


Basic notions 


1.1 Metrics and ultrametrics 

Remember that a metric on a set M is a nonnegative real-valued function d{x, y) 
defined for x,y G M that satisfies the following three conditions. First, 

(1.1) d{x, y) = 0 if and only it x = y. 

Second, d{x, y) should be symmetric in x and y, so that 

(1.2) d{x,y) = d{y,x) 
for every x,y G M. Third, 

(1.3) d{x,z) <d{x,y) + d(y,z) 

for every x,y,z G M, which is to say that d{-, •) satisfies the triangle inequality 
on M. If 

(1.4) d{x, z) < max{d{x, y), d{y, z)) 

for every x,y,z G M, then d{-, •) is said to be an ultrametric on M. Of course, 
the ultrametric version of the triangle inequality (1.4) automatically implies the 
ordinary triangle inequality (1.3). It is easy to see that the discrete metric on 
M is an ultrametric, which is defined by putting d{x, y) equal to 1 when x ^ y 
and to 0 otherwise. 

Let {M,d{x,y)) be a metric space, so that d{x,y) is a metric on a set M. 
The open and closed balls centered at a point x G M with radius r > 0 are 
defined as usual by 

(1.5) B{x,r) = {z G M ■. d{x,z) < r} 
and 

(1.6) B{x,r) = {z G M : d{x,z) < r}, 

1 



2 


CHAPTER 1. BASIC NOTIONS 


respectively. It is sometimes convenient to allow r = 0 in (1.6), so that (1.6) 
reduces to {x}. If y is any element of B{x, r), then t = r — d{x, y) > 0, and one 
can check that 

(1.7) B{y,t)CB{x,r), 

using the triangle inequality. Similarly, if y S B{x,r), then t = r — d{x,y) > 0, 
and 

(1.8) B{y,t) Q B{x,r), 

by the triangle inequality. One can also define collections of open and closed 
subsets of M in the standard way, to get a topology on M determined by the 
metric. Open balls are open sets with respect to this topology, by (1.7), and it 
is well known that closed balls are closed sets too. 

If d{-,-) is an ultrametric on M, then it is easy to see that (1.7) holds with 
t = r. More precisely, 

(1.9) B{x,r) = B{y,r) 

for every x,y G M with d{x, y) < r, because each of the two balls is contained in 
the other, by the same argument. Similarly, (1.8) holds with t = r when d{-, •) 
is an ultrametric on M, and in fact 

(1.10) B{x,r) = B{y,r) 

for every x,y G M with d{x, y) < r. It follows that closed balls of positive radius 
are also open subsets of M in this case. One can check that open balls in M 
are closed sets too when d{-, ■) is an ultrametric, which is equivalent to saying 
that the complement of an open ball is an open set in this situation. 

Let us continue to suppose that d{-,-) is an ultrametric on M, and let x, y, 
z be elements of M. If d{x,y) < d{y,z), then (1.4) implies that 

(1.11) d{x,z) < d{y,z). 

If d{x, y) < d{y, z), then 

(1.12) d{y, z) < max(d(y, x), d(x, z)) 
implies that 

(1.13) d{y,z) < d(x,z). 

It follows that 

(1.14) d(x,z) = d{y,z) 

when d{x,y) < d{y,z), by combining (1.11) and (1.13). 

1.2 Quasimetrics 

Let M be a set, and let d(x, y) be a nonnegative real-valued function defined 
ior x,y G M that satisfies the first two requirements (1.1) and (1.2) of a metric 
in Section 1.1. If there is a real number C > 1 such that 


(1.15) 


d{x, z) <C {d{x, y) -f d{y, z)) 



1.2. QUASIMETRICS 


3 


for every x,y, z G M, then d{-, •) is said to be a quasimetric on M. Equivalently, 
c?(-, •) is a quasimetric on M if there is a real number C" > 1 such that 

(1.16) d{x, z) < C max(d(a;, y), d(y, z)) 

for every x,y,z € M. More precisely, (1-15) implies (1-16) with C taken to 
be 2(7, and (1.16) implies (1.15) with C taken to be C . Of course, (1.15) is 
the same as (1.3) when (7=1, and (1.16) is the same as (1.4) when C = 1. 
If d{-, •) is a quasimetric on M, then one can define open and closed balls in 
M with respect to d{-, ■) in the same way as before, as in (1.5) and (1.6). One 
can also use open balls in M to define a collection of open subsets of M in the 
standard way, which leads to a topology on M. However, open balls in M are 
not necessarily open sets in this situation, and one should be a bit careful about 
some other differences as well. We shall not be too concerned with quasimetrics 
here, but the terminology will sometimes be convenient. 

Suppose that d{x,y) is a quasimetric on a set M that satisfies (1.16) for 
some C > 1, and that a is a positive real number. It is easy to see that d{x, yY 
is also a quasimetric on M under these conditions, because 

(1.17) d{x,zY < {C'Y niax(c?(a;, ?/)“, fi(y, z)“) 

for every x,y, z G M. In particular, if d(x, y) is an ultrametric on M, then 
d{x,yY is an ultrametric on M for every a > 0, since one can take C' = 1 
in (1.17). If d{x,y) is any quasimetric on M and a > 0, then the open ball 
centered at a point in M with radius r > 0 with respect to d{x, y) is the same 
as the open ball centered at the same point in M with radius r“ with respect 
to d{x,yY- This implies that the topology on M associated to d{x,yY is the 
same as the topology associated to d{x, y) for every a > 0. 

If 0 < a < 1, then it is well known that 

(1.18) (r + t)“<r“+t“ 
for every r,t > 0, which is the same as saying that 

(1.19) r+ t < (r“+r)i/“. 

Indeed, 

(1.20) max(r, t) < (r“ + 
for every r,t > 0, which implies that 

(1.21) r + t< max(r, (r“ + C) < (r“ + t“)((i-“)/“)+i = (r“ + 

as desired. If d{x,y) is a quasimetric on a set M that satisfies (1.15) for some 
(7 > 1, then it follows that 

(1.22) d{x, zY < (7“ {d{x, yY + d{y, zY) 

for every x,y,zGM when 0 < a < 1. In particular, if d(x, y) is a metric on M, 
then d{x, yY is also a metric on M when 0 < a < 1, since we can take (7 = 1 in 

( 1 . 22 ) . 



4 


CHAPTER 1. BASIC NOTIONS 


If o > 1, then /(r) = r“ is a convex function on [0, oo), which implies that 

(1.23) (r + = 2“ (r/2 + t/2)“ < 2“ (r“/2 + C/2) = 2“-^ (r“ + C) 

for every r,t > 0. If d{x,y) is again a quasimetric on a set M that satisfies 
(1.15) for some C > 1, then we get that 

(1.24) d{x, z)“ < 2“-i {d{x, y)“ + d{y, 2 )“) 

for every x,y, z € M when a > 1. This gives another way to see that d{x, y)°' is 
a quasimetric on M for every a > 0 when d(x, y) is a quasimetric on M, using 
(1.22) and (1.24) instead of (1.17). If M is the real line R and d{x, y) is the 
standard metric on R, then it is easy to see that d{x,y)°' is not a metric on R 
for any a > 1. 


1.3 Absolute value functions 

Let fc be a held. A nonnegative real-valued function |a;| dehned for a; S /c is said 
to be an absolute value function on k if it satishes the following three conditions. 
First, 

(1.25) \x\ = 0 if and only if a; = 0. 

Second, | • | should be multiplicative on k, which is to say that 

(1.26) |a;?/| = |a:| |j/| 

for every x,y G k. Third, | • | should satisfy the triangle inequality on k, in the 
sense that 

(1.27) |a; + 2 /| < |a:|-h |y| 
for every x,y G k. 

Of course, the standard absolute value of a real number x is dehned by 
putting |a;| = x when a: > 0 and \x\ = —x when a; < 0. This satishes the three 
conditions mentioned in the preceding paragraph, and hence dehnes an absolute 
value function on the held R of real numbers. Similarly, the usual absolute value 
or modulus of a complex number dehnes an absolute value function on the held 
C of complex numbers. If k is any held, then the trivial absolute value function 
is dehned on k by putting |0| = 0 and 

(1.28) |a;| = 1 for every x G k with a; ^ 0. 

It is easy to see that this satishes the three conditions in the previous paragraph 
as well. 

Let I ■ I be an absolute value function on a held k. As usual, we let 0 and 
1 denote the additive and multiplicative identity elements of k, respectively, as 
well as their counterparts in R, and it should always be clear from the context 
which is intended. Note that |1| > 0, since 1 ^ 0 in fc, by dehnition of a held. 
Because 1^ = 1 in k, we get that |lp = |1^| = |1|, by (1.26), and hence 

(1.29) |1| = 1. 



1.3. ABSOLUTE VALUE EUNCTIONS 


5 


If X € k satisfies x” = 1 for some positive integer n, then it follows that 

(1.30) |a:r = |x"| = |l| = l, 
so that |a;| = 1. 

If X € k, then the additive inverse of a; in A: is denoted —x, as usual. In 
particular, —1 denotes the additive inverse of 1 in k, and it is easy to see that 

(1.31) (“1) X = —X 

for every x € k. Applying this to a; = —1, we get that (—1)^ = 1, and hence 

(1.32) |-1| = 1, 

by (1.30). It follows that 

(1.33) |-a;| = |a;| 

for every a; € fc, by (1.26), (1.31), and (1.32). If a; € fc and x ^ 0, then x has a 
multiplicative inverse x~^ in k, and 

(1.34) |x-'l = k|-' 

by (1.26) and (1.29). 

If we put 

(1.35) d{x,y) = \x-y\ 

for each x,y € k, then d{x,y) is symmetric in x and y, by (1.33). Thus (1.35) 
defines a metric on fc, since (1.1) and (1.3) in Section 1.1 follow from (1.25) and 
(1.27). Let us say that | • | is an ultrametric absolute value function on k if it 
satisfies 

(1.36) |a;+ 2/1 < max(|a;|, |y|) 

for every x,y € k, which implies (1.27). In this case, (1.35) is an ultrametric 
on k, because (1.4) in Section 1.1 follows from (1.36). The trivial absolute 
value function on k is an ultrametric absolute value function, for which the 
corresponding ultrametric (1.35) is the discrete metric. 

As another class of examples, let p be a prime number, and let us recall the 
definition of the p-adic absolute value \x\p of a rational number x. Of course, 
|0|p = 0. Otherwise, if a; ^ 0, then x can be expressed as pi (a/b), where a, b, 
and j are integers, a,b ^ 0, and neither a nor b is divisible by p. In this case, 
we put 

(1.37) \x\p=p-^, 

and one can check that this defines an ultrametric absolute value function on 
the field Q of rational numbers. The corresponding ultrametric 

(1.38) dp{x,y) = \x - y\p 
is known as the p-adic metric on Q. 



6 


CHAPTER 1. BASIC NOTIONS 


If I • I is any absolute value function on a field k, then the corresponding 
metric (1.35) determines a topology on k in the usual way. It is easy to see 
that addition and multiplication define continuous mappings from k x k into k 
under these conditions, using the product topology on k x k associated to this 
topology. Similarly, one can check that 

(1.39) X !->■ x~^ 

defines a continuous mapping from fc\{0} into itself in this situation. The proofs 
of these statements are analogous to standard arguments for real and complex 
numbers. 

Let I • I be an ultrametric absolute value function on a field k, and let x,y £ k 
be given. If |a;| < |j/|, then 

(1.40) Ix + j/l < max(|a;|, |y|) = |y|. 

If |a;| < |y|, then 

(1.41) |y| = I - a; + (a; + y)| < max(| - x\, \x + y\) = max(|a;|, |a; + y|) 
implies that 

(1.42) \y\<\x + y\. 

It follows that 

(1.43) \x + y\ = \y\ 

when |a;| < ly], by combining (1.40) and (1.42). Of course, this can also be 
considered as a special case of the remarks at the end of Section 1.1. 


1.4 Completeness 

Remember that a metric space (M, d{x, y)) is said to be complete if every Cauchy 
sequence of elements of M converges to an element of M. If M is not complete, 
then it is well known that M can be completed, in the sense that there is an 
isometric embedding of M onto a dense subset of a complete metric space. Such 
a completion of M is also unique up to isometric equivalence. If d{x, y) is an 
ultrametric on M, then it is not difficult to see that the completion of M will 
also be an ultrametric space too. 

Let fc be a field, let | • | be an absolute value function on k, and consider 
the corresponding metric (1.35). If k is not already complete with respect to 
this metric, then k can be completed as a metric space, as in the preceding 
paragraph. One can check that the absolute value function and field operations 
on k can also be extended to the completion in a natural way. More precisely, 
the absolute value function on k is the same as the distance to 0 with respect to 
the corresponding metric, and so its extension to the completion of k is already 
included in the metric on the completion of k. Addition and multiplication 
on k can be extended as mappings from k x k into k to analogous mappings 



1.5. Q UASIMETRIC ABSOL UTE VAL UE FUNCTIONS 


7 


for the completion of fc, with the appropriate continuity properties. Nonzero 
elements of the completion of k have multiplicative inverses in the completion 
of k, so that the completion of k becomes a field. The extension of the absolute 
value function on k to the completion of k is an absolute value function on the 
completion of k, which corresponds to the metric on the completion of k in the 
same way as before. If | • | is an ultrametric absolute value function on fc, then its 
extension to the completion of k is also an ultrametric absolute value function. 

Of course, the real and complex numbers are already complete with respect 
to their standard Euclidean metrics. The set Q of rational numbers is not 
complete with respect to the standard Euclidean metric, and its completion as 
a metric space corresponds to the real line with the standard Euclidean metric. 
More precisely, the completion of Q as a field with the standard absolute value 
function corresponds to the field R of real numbers with the standard absolute 
value function, as in the preceding paragraph. One can also show that Q is 
not complete with respect to the p-adic metric for any prime number p. The 
completion of Q with respect to the p-adic metric leads to the field Qp of p- 
adic numbers. The extension of the p-adic absolute value and metric to Qp are 
denoted | • |p and dp{-, •), as before. Because the p-adic absolute value is an 
ultrametric absolute value function on Q, its extension to Qp is an ultrametric 
absolute value function as well. 

If p is a prime number, p € Q, and p 7 ^ 0, then |p|p is an integer power of 
p, by construction. Similarly, if p € Qp and p 7 ^ 0, then |p|p is still an integer 
power of p. This follows from the construction of Qp as the completion of Q 
with respect to the p-adic metric, and it can also be derived from the analogous 
statement for Q and the fact that Q is dense in Qp, using (1.43) in Section 1.3. 


1.5 Quasimetric absolute value functions 

Let fc be a field again, and let | • | be a nonnegative real-valued function on k 
that satisfies (1.25) and (1.26) in Section 1.3. As before, this implies that | • | 
satisfies (1.29), (1.30), (1.32), (1.33), and (1.34) in Section 1.3. Let us say that 
I • I is a quasimetric absolute value funetion on k if there is a real number C > 1 
such that 

(1.44) |x-bp| < Cdxl-b IpI) 

for every x,y € k. Equivalently, | • | is a quasimetric absolute value function on 
k if there is a C" > 1 such that 

(1.45) |a;-bp| < C" max(|x|,|p|) 

for every x,y G k. As in Section 1.2, (1.44) implies (1.45) with C taken to 
be 2(7, and (1.45) implies (1.44) with C taken to be C. Note that (1.27) in 
Section 1.3 is the same as (1.44) with (7 = 1, and that (1.36) in Section 1.3 is 
the same as (1.45) with C = 1. If | • | is a quasimetric absolute value function 
on k, then (1.35) is a quasimetric on k, where (1.15) and (1.16) in Section 1.1 
correspond exactly to (1.44) and (1.45), respectively. 



CHAPTER 1. BASIC NOTIONS 


If I a: I is a quasimetric absolute value function on k, then |a;|“ is also a quasi¬ 
metric absolute value function on k for every positive real number a. More 
precisely, if |a;| satisfies (1.45) for some C" > 1, then 

(1.46) |a: + yr<(C')“max(|x|Mj/r) 

for every x,y € k. In particular, if |x| is an ultrametric absolute value function 
on k, then |a;|“ is an ultrametric absolute value function on k for every a > 0 , 
since one can take C" = 1 in (1.46). Similarly, if \x\ satisfies (1.44) for some 
C > 1 , then 

(1.47) i^+y|«<c“(|xr + ij/r) 

for every x,y € k when 0 < a < 1, by (1.18) in Section 1.2. If a > 1, then we 
get that 

(1.48) k-ki/r < 2“-iC'“(|a;|“-k l^r) 

for every x,y G k, by (1.23) in Section 1.2. If \x\ is an absolute value function 
on k, then it follows from (1.47) that |a;|“ is an absolute value function on k 
when 0 < a < 1, since one can take (7 = 1. If |a;| is the standard absolute value 
function on Q or R, then |x|“ is not an absolute value function for any a > 1. 

Let I • I be a nonnegative real-valued function on a field k that satisfies (1.25) 
and (1.26) in Section 1.3 again. If | • | satisfies (1.45) for some C > 1, then it 
follows that 

(1.49) \l + z\<C' 

for every z G k with \z\ < 1, by (1.29) in Section 1.3. Conversely, one can check 
that (1.49) implies (1.45), using (1.26) in Section 1.3. More precisely, (1.45) is 
trivial when x = 0 or t/ = 0, and so one may as well suppose that x,y ^ 0. 
If \x\ < |y|, then one can apply (1.49) to z = xy~^ to get (1.45). Similarly, 
if ll/I < |a^|, then (1.45) follows from (1.49) applied to z = x~^y. Thus | • | is 
a quasimetric absolute value function on k if and only if it satisfies (1.49) for 
some C > \. This corresponds to Definition 1.1 on pl2 of [2], with different 
terminology. 

If I • I is an absolute value function on a field k, then | • | satisfies (1.45) with 
C' = 2. Conversely, if | • | is a quasimetric absolute value function on k that 
satisfies (1.45) with C = 2, then it can be shown that | • | is an absolute value 
function on k. See Lemma 1.2 on pl3 of [2]. If |x| is a quasimetric absolute 
value function on k that satisfies (1.45) for some C > 1, then it follows that 
|x|“ is an absolute value function on k for every a > 0 such that (C")“ < 2, by 
(1.46). In particular, this condition holds for all sufficiently small a > 0. 


1.6 The archimedian property 

Let fc be a field, and let Z+ denote the set of positive integers, as usual, li x G k 
and n is a positive integer, then we let n • x denote the sum of n x’s in k. It is 
easy to see that 

(1.50) ni • (n 2 • x) = (ni 71 . 2 ) • x 



1.6. THE ARCHIMEDIAN PROPERTY 


9 


for every ni,n 2 € Z+ and x € k, and that 

(1.51) {n • x)y = n ■ {xy) 
for every n G Z+ and x,y € k. In particular, 

(1.52) n ■ X = {n • l)x 

for every n G Z_|- and x G fc, where 1 refers to the multiplicative identity element 
in k. Similarly, 

(1.53) (ni ■ x) {n2 ■ y) = (ni n2) ■ (xy) 

for every ni,n2 G Z+ and x,y € k, which can be verified directly, or using (1.50) 
and (1.51). 

Let I • I be an absolute value function on k. Observe that 

(1.54) |n • 1| < n 

for every n G Z+, by (1.29) in Section 1.3. If | • | is an ultrametric absolute value 
function on k, then 

(1.55) |n-l|<l 
for every n G Z+. Of course, 

(1.56) |(ni n 2 ) • 1| = |(ni • l)(n 2 • 1)| = |ni • 1| |n 2 • 1| 
for every ni, n 2 G Z+, which implies that 

(1.57) |n^-l| = |n-ip 

for every j,n G Z+. If |n • 1| > 1 for some n G Z+, then it follows that (1.57) 
tends to infinity as j —?■ oo. 

An absolute value function | • | on a field k is said to be archimedian if the 
nonnegative real numbers of the form |n • 1| with n G Z^ do not have a finite 
upper bound. Otherwise, | • | is said to be non-archimedian, which means that 
there is a positive real number A such that 

(1.58) |n-l|<A 

for every n G Z+. Equivalently, | • | is archimedian if |n • 1| > 1 for some n G Z+, 
and I • I is non-archimedian if (1.58) holds with A = 1, by the remarks in the 
preceding paragraph. Ultrametric absolute value functions are obviously non- 
archimedian, as in (1.55). Conversely, if an absolute value function | • | on fc 
satisfies (1.55) for every n G Z+, then it can be shown that | • | is an ultrametric 
absolute value function on k. See Lemma 1.5 on pl6 of [2], or Theorem 2.2.2 
on p28 of [12]. This also works for quasimetric absolute value functions, using 
an analogous argument, or by reducing to the case of ordinary absolute value 
functions, as mentioned at the end of the preceding section. 

If k has positive characteristic, then there are only finitely many elements of 
k of the form nT for some n G Z_|_. This implies that any absolute value function 
on k is non-archimedian, and hence an ultrametric absolute value function. 



10 


CHAPTER 1. BASIC NOTIONS 


1.7 Topological equivalence 

Let fc be a field, and let | • |i and | • I 2 be absolute value functions on k. Also let 

(1.59) di(x,y) = \x -y\i and d 2 {x,y) = \x - y \2 

be the corresponding metrics on fc, as in (1.35) in Section 1.3. Let us say that 
I'll and 1-12 are equivalent as absolute value functions on k if there is a positive 
real number a such that 

(1.60) |a:|2 = \x\^ 

for every x € k. Of course, this implies that 

(1.61) d 2 {x,y) = di{x,y)°- 

for every x,y G k, and hence that di{x,y) and d 2 {x,y) determine the same 
topology on k. Conversely, if | • |i and | • I 2 are topologically equivalent, in the 
sense that di{x,y) and d 2 {x,y) determine the same topology on fc, then it can 
be shown that | • |i and 1-12 are equivalent in this sense. See Lemma 3.2 on p20 
of [2], or Lemma 3.1.2 on p42 of [12]. Part of the proof is to observe that the 
open unit ball in k with respect to an absolute value function can be described 
topologically as the set oi x £ k such that x^ 0 as j -£■ 00 . Thus topological 
equivalence of the absolute value functions implies that they have the same open 
unit balls in k, one can show that this implies that the absolute value functions 
are equivalent in the sense of (1.60). 

Of course, the trivial absolute value function on k corresponds to the discrete 
metric on k, and hence the discrete topology. Conversely, if the topology on k 
determined by the metric associated to an absolute value function j • j is the 
discrete topology, then | ■ | is the trivial absolute value function on k. This 
follows from the characterization of topological equivalence mentioned in the 
preceding paragraph, but one can also check it more directly. More precisely, 
if the topology determined by the metric associated to | ■ | is discrete, then the 
open unit ball in k with respect to | • | contains only 0, because of the topological 
description of the open unit ball in the previous paragraph. It is easy to see 
that this implies that j • j is the trivial absolute value function on k, using (1.34) 
in Section 1.3. 

Let I • |i and 1-12 be absolute value functions on k again, and suppose for the 
moment that the topology on k determined by di{x,y) is at least as strong as 
the topology determined by d 2 {x,y). This means that every open set in k with 
respect to d 2 {x,y) is also an open set with respect to di{x,y), and hence that 
any sequence of elements of k that converges to 0 with respect to di{x,y) also 
converges to 0 with respect to d 2 {x,y). It follows that the open unit ball in k 
with respect to | • |i is contained in the open unit ball with respect to | • [ 2 , by 
the topological description of the open unit ball mentioned earlier. Of course, 
this holds automatically when j • |i is the trivial absolute value function on k. 

If I • |i is not the trivial absolute value function on k, and if the open unit 
ball in k with respect to j • |i is contained in the open unit ball with respect to 



1.8. OSTROWSKI’S THEOREMS 


11 


I • I 2 , then I • |i and | • I 2 are equivalent on k. This is Lemma 3.1 on pl8 of [2]. 
More precisely, one can show that the open unit balls in k with respect to | • |i 
and I • I 2 are also the same under these conditions, and then the rest of the proof 
is the same as before. It follows that | • |i and | • I 2 are equivalent on k when 
I'll is not the trivial absolute value function and the topology on k determined 
by di{x,y) is at least as strong as the topology determined by d 2 {x,y), by the 
remarks in the previous paragraph. 


1.8 Ostrowski’s theorems 

Let I • I be a nontrivial absolute value function on the field Q of rational numbers. 
A famous theorem of Ostrowski states that | • | is either equivalent to the standard 
Euclidean absolute value on Q, or to the p-adic absolute value on Q for some 
prime number p. See Theorem 2.1 on pl6 of [2], or Theorem 3.1.3 on p44 of 
[12]. More precisely, | • | is equivalent to the standard Euclidean absolute value 
on Q exactly when j • j has the archimedian property on Q. If | • | is a nontrivial 
absolute value function on Q that is non-archimedian, then |n| < 1 for every 
n G Z+, and |n| < 1 for some n G Z+. If p is the smallest positive integer such 
that IpI < I, then p > I, and it is easy to see that p has to be a prime number, 
because of the multiplicative property of absolute value functions. Under these 
conditions, one can show that | • | is equivalent to the p-adic absolute value on 
Q for this prime number p. 

Suppose now that A: is a field of characteristic 0, and that | • | is an absolute 
value function on fc. It is well known that there is a natural embedding of Q 
into k under these conditions, so that | • | induces an absolute value function 
on Q. Note that j • j has the archimedian property on k if and only if the 
induced absolute value function has the archimedian property on Q. In this 
case, Ostrowki’s theorem implies that the induced absolute value function on Q 
is equivalent to the standard Euclidean absolute value on Q. If A: is also complete 
with respect to the metric associated to | • j, then the natural embedding of Q 
into k can be extended continuously to an embedding of R into k. 

Similarly, | • | is non-archimedian on k if and only if the induced absolute 
value function on Q is non-archimedian. In this case, if the induced absolute 
value function on Q is also nontrivial, then Ostrowski’s theorem implies that the 
induced absolute value function on Q is equivalent to the p-adic absolute value 
for some prime number p. If k is complete with respect to the metric associated 
to I • I, then the natural embedding of Q into k can be extended continuously to 
an embedding of Qp into k. 

Let I • I be an archimedian absolute value function on a field k, so that k 
has characteristic 0, as in Section 1.6. If k is complete with respect to the 
corresponding metric, then another famous theorem of Ostrowski implies that k 
is isomorphic to either the real or complex numbers, where | • | corresponds to an 
absolute value function on R or C that is equivalent to the standard one. See 
Theorem I.l on p33 of [2]. As before, the natural embedding of Q into k extends 
continuously to an embedding of R into k under these conditions, and the first 



12 


CHAPTER 1. BASIC NOTIONS 


possibility is that this embedding is surjective. Otherwise, Ostrowki’s theorem 
implies that k is isomorphic to the complex numbers, where this embedding of 
R into k corresponds exactly to the standard embedding of R into C. 

1.9 Discrete absolute value functions 

Let fc be a field, and let | • | be an absolute value function on fc. If | • | is nontrivial 
on fc, then there is an a: G fc such that x ^ 0 and \x\ ^ 1. More precisely, either 
there is a y G fc such that j/ 0 and |j/| < 1, or there is a 2 ; G fc such that 
l^l > 1. In fact there are both such a y and z in fc, since each type of element 
of fc can be obtained from the other by taking the multiplicative inverse. By 
taking powers of such elements of fc, one can get nonzero elements of fc whose 
absolute value is arbitrarily large or small. 

If I • I is any absolute value function on fc, then 

(1.62) {|a;| : a; G fc, x 0} 

is a subgroup of the multiplicative group R+ of positive real numbers. Let us 
say that | • | is discrete on fc if there is a positive real number p < I such that 

(1.63) \x\ < p 

for every x G fc with |x| < I. Equivalently, this means that 

(1.64) |x| > 1/p 

for every x G fc with |x| > 1, by applying (1.63) to 1/x. This is also the same as 
saying that 1 is not a limit point of (1.62) with respect to the standard metric 
on R. One can check that this implies that (1.62) has no limit points in R+, 
although 0 is a limit point of (1.62) in R when | • | is nontrivial, as in the previous 
paragraph. Of course, the trivial absolute value function on any field is discrete. 
If p is a prime number, then the p-adic absolute value function is discrete on 
fc = Q or Qp, with p = 1/p. 

Suppose for the moment that | • | is an archimedian absolute value function 
on fc. This implies that fc has characteristic 0, as in Section 1.6, so that there is 
a natural embedding of Q into fc. We have also seen that the induced absolute 
value function on Q is archimedian under these conditions, and hence that 
the induced absolute value function on Q is a positive power of the standard 
absolute value function on Q, by Ostrowski’s theorem. The standard absolute 
value function on Q is obviously not discrete, which means that | • | is not 
discrete on fc. This shows that discrete absolute value functions are always 
non-archimedian. 

Let I • I be an absolute value function on a field fc again, and put 

(1.65) Pi = sup{|x| : X G fc, |x| < 1}, 

so that 0 < Pi < 1. Thus pi < 1 if and only if | • | is discrete on fc, and pi = 0 
if and only if | • | is the trivial absolute value function on fc. Suppose now that 



1.10. NONNEGATIVE SUMS 


13 


I • I is discrete and nontrivial on k, so that 0 < pi < 1. It is not too difficult to 
check that there is an a; i € k such that 

(1.66) |a;i|=pi 

under these conditions. More precisely, pi is an element of the closure of (1.62) 
in R_|_ when | • | is nontrivial on fc, and (1.62) has no limit points in R_|_ when 
I • I is discrete on k. This implies that pi is an element of (1.62), so that pi can 
be expressed as in (1.66). It follows that 

(1.67) \x{\ = \xi\^ = p{ 

for each j S Z. If w is any nonzero element of fc, then one can also verify that 
there is a j G Z such that 

( 1 . 68 ) \w\=pi, 

so that (1.62) consists exactly of integer powers of pi. Otherwise, if |u;| lies 
between two successive powers of pi, then one can use a suitable power of xi 
to reduce to the case where |w| lies strictly between pi and 1, contradicting the 
definition of pi. 

1.10 Nonnegative sums 

Let X be a nonempty set, and let f(x) be a nonnegative real-valued function 
on X. Of course, if X has only finitely many elements, then the sum 

(1.69) ^ f{x) 

x^X 


can be defined in the usual way. Otherwise, (1.69) is defined as a nonnegative 
extended real number to be the supremum of the sums 

(1.70) ^ /(x) 

x^A 


over all nonempty finite subsets A oi X. If the finite sums (1.70) have an upper 
bound in R, so that (1.69) is finite, then / is said to be summable on X. It 
is sometimes convenient to permit sums over the empty set, in which case the 
sum is interpreted as being equal to 0. It will also sometimes be convenient 
to consider sums of nonnegative extended real numbers, which can be handled 
with straightforward adjustments. The main point is that a sum is automatically 
infinite when any of the terms being summed is infinite. 

li X = Z+, then it is customary to look at the infinite series 

OO 

H /O’) 

1=1 


(1.71) 



14 


CHAPTER 1. BASIC NOTIONS 


as the limit of the corresponding sequence 

n 

(1-72) ^/(j) 

i=i 

of partial sums. Of course, the partial sums (1-72) are monotonically increasing 
in n when / is a nonnegative real-valued function on Z^.. If the partial sums 

(1.72) are bounded, then they converge to their supremum as a sequence of real 
numbers, and otherwise they tend to -foo as n —> oo in the usual sense. It is 
easy to see that the supremum of the partial sums (1.72) over n € is the 
same as the supremum of the sums (1-70) over finite subsets Aoi X = Z+, since 
any such set A is contained in the set {1, 2,3,..., n} for some n G Z+. This 
implies that this interpretation of the sum (1.71) is equivalent to (1.69) when 
X = Z+. 

If X is any countably infinite set, then one can enumerate the elements of X 
by a sequence to reduce to the case of ordinary infinite series again. Note that 
the resulting value of the sum does not depend on the choice of the enumeration 
of the elements of X. This follows from the fact that any rearrangement of an 
infinite series of nonnegative real numbers has the same sum as the initial series. 
The definition of (1.69) as the supremum of all finite subsums (1.70) is already 
invariant under any permutation of the elements of X, by construction. 

Let / be a nonnegative real-valued summable function on any set X, and 
let e > 0 be given. Observe that 

(1.73) f(x) > e 

for at most finitely many x € X, since otherwise the sums (1.70) would be 
unbounded. More precisely, the number of x € X such that (1.73) holds is 
less than or equal to 1/e times (1.69). It follows that there are only finitely or 
countably many x € X such that f(x) > 0, by taking e = 1/n for each n G Z+. 
Thus the sum (1.69) can always be reduced to a finite sum or an infinite series 
when it is finite. 

If f, g are nonnegative real-valued functions on any set X, then one can 
check that 

(1.74) ^(/(x)-f g(a;)) = ^/(x) + ^ 5 (x), 

xGX xGX xGX 

where the right side of (1.74) is interpreted as being -|-c>o when either of the two 
sums is infinite. Similarly, 

(1.75) ^ a/(x) = a ^/(x) 

x^X xGX 

for every nonnegative real-valued function f on X and positive real number 
a, where the right side of (1.75) is interpreted as being infinite when (1.69) is 
infinite. If a = 0, then the right side of (1.75) should be interpreted as being 
equal to 0, even when / is not summable on X. 



1.11. NONNEGATIVE SUMS, CONTINUED 


15 


If / is a nonnegative real-valued function on a set X and B, C are disjoint 
subsets of X, then 

(1-76) 

x^BuC x^B xGC 

This can be verified directly, or derived from (1.74). 

Let / be a nonnegative real-valued summable function on a set X again, and 
let e > 0 be given. By definition of (1.69), there is a finite set A(e) C X such 
that 

(1.77) ^/(x) < f{x) + e. 

xGX xGA(e) 

Equivalently, this means that 

(1.78) Y 

a:GX\A(e) 

by (1.76). 

1.11 Nonnegative sums, continued 

Let / be a nonempty set, and let Ej be a set for each j G I. Suppose that 

(1.79) E^f]Ei=% 
for every j,l G I with j ^ I, and put 

(1.80) E=\Je,. 

If / is a nonnegative real-valued function on E, then 

(1-81) Ax) =Y{J2 

x^E jel xGEj 


More precisely, if 

(1.82) Y fA) = +00 

x^Ej 

for any j G I, then the sum over I on the right side of (1-81) is automatically 
interpreted as being infinite, as mentioned in the previous section. Otherwise, 
the right side of (1.81) is a sum of nonnegative real numbers, which may still 
be infinite. 

In order to prove (1.81), let us first observe that 

H H fA)) < Y fA) 

jG/i xGEj x^E 


(1.83) 



16 


CHAPTER 1. BASIC NOTIONS 


for every finite set Ii C I, by (1.76). This implies that 

(1-84) XI ( XI •^(^0 - X 

je/ x&Ej x&E 

by taking the supremum over all finite subsets Ii of I. To get the opposite 
inequality, it suffices to verify that 

(1-85) X - X ( X 

xGA j^I xGEj 

for every finite set ACE. Of course, if A is any finite subset of E, then there 
is a finite set Ii C I such that 

( 1 . 86 ) Ac\Je,. 

J&h 

In this case, it is enough to take the sum over j G Ii on the right side of (1.85). 

Now let Y and Z be nonempty sets, and let f{y, z) be a nonnegative real¬ 
valued function on their Cartesian product Y x Z. Thus 

(1-87) X/(y’^) 

y&Y 

can be defined as a nonnegative extended real number for each z G Z as before, 
and similarly 

( 1 - 88 ) '^f{y,z) 

zez 

can be defined as an nonnegative extended real number for each y G Y. The 
iterated sums 

(1-89) X(X-^(2^’^)) 

zGZ yGY 

and 

(1-90) X ( X ^)) 

yGY z<zZ 

are also defined as nonnegative extended real numbers, as well as the sum 

(1-91) X 

{y,z)(SYxZ 

taken over Y x Z directly. Under these conditions, the sums (1.89), (1.90), and 
(1.91) are all equal. More precisely, the equality of either (1.89) or (1.90) with 
(1.91) follows from (1.81). 



Chapter 2 


Norms on vector spaces 


Throughout this chapter, we let fc be a field, and | • | be an absolute value 
function on k. 

2.1 Norms and ultranorms 

Let V he a vector space over k. A nonnegative real-valued function N on V 
over k is said to be a norm on V if it satishes the following three conditions. 
First, 

(2.1) N{v) = 0 if and only if u = 0. 

Second, 

(2.2) N{tv) = \t\N{v) 
for every t G k and v G V. Third, 

(2.3) N{v+ w) < N{v) + N{w) 
for every v,w gV. If 

(2.4) N{v + w)< max{N{v), N{w)) 

for every v,w G V, then N is said to be an ultranorm on V. Of course, (2.4) 
automatically implies (2.3), and it is easy to see that | • | must be an ultrametric 
absolute value function on k if there is an ultranorm on V and V ^ {0}. 

If is a norm on V, then 

(2.5) d{v, w) = N{v — w) 

defines a metric on V, which is an ultrametric when N is an ultranorm on V. 
Remember that | — 1| = 1, as in (1.32) in Section 1.3, which together with (2.2) 
implies that (2.5) is symmetric in v and w. If | • | is the trivial absolute value 
function on k, then the trivial ultranorm on V is defined by putting iV(0) = 0 
and 

(2.6) N{v) = I for every v GV with v yf 0. 


17 



18 


CHAPTER 2. NORMS ON VECTOR SPACES 


It is easy to see that this is an ultranorm on V, for which the corresponding 
metric (2.5) is the discrete metric. 

Let n be a positive integer, and let A:" be the space of n-tuples of elements of 

k, considered as a vector space over k with respect to coordinatewise addition 
and scalar multiplication. Note that 

(2.7) IVo(n) = maxdml,..., |v„|) 

defines a norm on fc”, which is an ultranorm when | ■ | is an ultrametric absolute 
value function on k. Let do{v,w) be the metric associated to Nq as in (2.5). By 
construction, an open or closed ball in /c" with respect to do(v,w) is the same 
as the Cartesian product of n open or closed balls in k of the same radius with 
respect to the metric associated to | • |, respectively. In particular, the topology 
on fc" determined by do{v, w) is the same as the product topology corresponding 
to the topology on k determined by the metric associated to | • | • If A: is complete 
with respect to the metric associated to | • |, then it is easy to see that A:” is 
complete with respect to do{v,w). If | • | is the trivial absolute value function 
on k, then TVq is the trivial ultranorm on A:”. 

Let V be any vector space over k again, and let be a norm on V. If V 
is not already complete with respect to the associated metric (2.5), then one 
can pass to its completion as a metric space in the usual way, as in Section 

l. 4. By standard arguments, the vector space operations and the norm N can 
be extended to the completion of 14, in such a way that the completion of V 
also becomes a vector space over k, and so that the extension of the norm N 
to the completion of 14 is a norm on the completion of 14 as a vector space over 
k too. Of course, if k is not complete with respect to the metric associated to 
the absolute value function | • |, then one can pass to its completion as well, as 
in Section 1.4. If 14 is already complete, and k is not complete, then one can 
extend scalar multiplication on 14 to the completion of k, so that V becomes a 
vector space over the completion of k, and so that AV is a norm on 14 as a vector 
space over the completion of k. 


2.2 The supremum norm 

Let AT be a nonempty set, and let (M, d{-, •)) be a metric space. Remember that 
a subset of M is said to be bounded if it is contained in a ball of finite radius, 
and that a function f on X with values in M is said to be bounded if /(AT) is a 
bounded set in M. Let B{X,M) be the space of bounded functions on X with 
values in M. If /, (/ S B{X, M), then 

(2.8) d{f{x),g{x)) 

is a bounded nonnegative real-valued function on AT, so that 

(2.9) snp d{f{x),g(x)) 

xGX 



2.2. THE SUPREMUM NORM 


19 


is defined as a nonnegative real number. It is well known that (2.9) defines a 
metric on the space of bounded functions on X with values in M, called the 
supremum metric. If d{-, •) is an ultrametric on M, then it is easy to see that 
the supremum metric is also an ultrametric on B{X,M). If M is complete 
with respect to d{-, •), then B{X, M) is complete with respect to the supremum 
metric, by standard arguments. 

Now let F be a vector space over k, and let be a norm on V, so that the 
remarks in the preceding paragraph can be applied to M = V, with the metric 
associated to N. Of course, a F-valued function / on X is bounded if and only 
if N{f{x)) is bounded as a nonnegative real-valued function on X. Let us use 
the notation £°°{X,V) for the space of bounded I^-valued functions / on X, 
which is a vector space over k with respect to pointwise addition and scalar 
multiplication. It is easy to see that 

(2-10) ll/lloo = \\f\U<->{x,v) = sup N{f{x)) 

xGX 

defines a norm on £°°{X,V), known as the supremum norm, for which the 
corresponding metric is the supremum metric. If N is an ultranorm on V, then 
the supremum norm is an ultranorm on £°°{X, V) too. 

A ^-valued function / on AT is said to vanish at infinity on X if for each 
e > 0, 

(2.11) N{f{x)) < e 

for all but finitely many x G X. It is easy to see that this implies that / 
is bounded on V, by taking e = 1. Thus the collection co{X,V) of F-valued 
functions on X that vanish at infinity is contained in E^{X,V), and is in fact 
a linear subspace of £°°{X,V). One can also check that co{X,V) is a closed 
set in £°°{X, V) with respect to the supremum metric. Note that / vanishes at 
infinity on X if and only if N{f{x)) vanishes at infinity on X as a real-valued 
function on X. 

The support of a F-valued function / on AT is defined to be the subset supp / 
of X given by 

(2.12) suppf = {xGX:f{x)^0}. 

The collection coo{X,V) of O-valued functions f on X such that supp / has 
only finitely many elements is a linear subspace of co(Ar, V). If / is a O-valued 
function on X that vanishes at infinity, then / can be approximated by elements 
of Coo (AT, y) with respect to the supremum norm, so that co{X, y) is the same 
as the closure of coo{X, V) in £°°{X, V). Observe that 

OO 

(2.13) supp/ = G X : N{f{x)) > 1/j} 

i=i 


has only finitely or countably many elements when / G co(Ar, V), because 

(2.14) {a: G X : N{f{x)) > 1/j} 



20 


CHAPTER 2. NORMS ON VECTOR SPACES 


has only finitely many elements for each positive integer j. If iV is the trivial 
ultranorm on V, then a y-valued function f on X vanishes at infinity only when 
supp / has only finitely many elements. 


2.3 Norms 

Let y be a vector space over k again, and let iV be a norm on V. Also let X 
be a nonempty set, and let r be a positive real number. A y-valued function / 
on X is said to r-summahle if 

(2.15) N{f{x)Y 

is summable as a nonnegative real-valued function on X, as in Section 1.10. If 
/ is r-summable with r = 1, then we may simply say that / is summable on X. 
The space of y-valued r-summable functions on X is denoted P'{X, V). 

If /, g are y-valued functions on X, then 

(2.16) N{f{x) + g{x)y < {N{f{x)) + N{g{x))y 

for every r > 0 and x & X, because of the triangle inequality (2.3) for N. This 
implies that 

(2.17) N{f{x) + g{x)y < N{f{x)y + N{g{x)y 
for every x G X when r < 1, by (1.18) in Section 1.2. Similarly, 

(2.18) Nifix) + gix)y < 2’-! (iV(/(x))’- + Nigix)y) 

for every x G X when r > 1, by (1.23) in Section 1.2. In both cases, we can 
take the sum over x G X, to get that f + g is r-summable when / and g are 
both r-summable. Of course, if f{x) is r-summable on X and t G k, then t f{x) 
is r-summable on X too, so that C{X,V) is a vector space with respect to 
pointwise addition and scalar multiplication. 

Put 

(2.19) ll/ll. = \\f\uw) = (E 

x&X 

for each / G C{X, V). Thus ||/||r is a nonnegative real number that is equal to 
0 exactly when / = 0, and 

(2.20) \\tf\\r = \t\\\f\\r 

for every t G k and / G P(X, y). If r > 1, then 

(2.21) 11/+ ffllr < ll/llr + llffllr 

for every f,g G P{X,V). This is well known when y = R with the standard 
absolute value function, and otherwise one can reduce to that case using (2.16). 
If r < 1, then 

(2.22) ||/ + g||;<||/||; + ||g||; 

for every f,gG P{X, V), as one can see by summing (2.17) over x G X. 



2.3. NORMS 


21 


Suppose for the moment that N is an ultranorm on V, and let /, g be 
y-valued functions on X again. In this case, we have that 

(2.23) N{f{x)+g{x)Y < max{N{f{x)),N{g{x))y 

= max(N(f(x)Y,N(g{x)Y) 

< N{f{x)Y + N{gix)Y 

for every r > 0 and x € X. This implies that (2.22) holds for every r > 0 and 
f,g G £^{X, V), by summing (2.23) over x € X. Note that (2.22) automatically 
implies (2.21) when r > 1, by (1.18) in Section 1.2. 

It follows from (2.21) that || ■ ||r defines a norm on £'^{X,V) when r > 1, 
which determines a metric on £'^{X,V) in the usual way. If r < 1, then (2.22) 
implies that 

(2.24) 11/-511; 

defines a metric on £'"{X, V). Note that every r-summable ^-valued function / 
on X is bounded, with 

(2.25) ||/||oo< ll/ll.. 

More precisely, such a function / vanishes at infinity on NT, so that 

(2.26) £YX,V)Cco{X,V) 

for every r > 0. If is complete with respect to the metric associated to N, 
then one can check that £'^{X, V) is complete with respect to the corresponding 
metric for every r > 0 , by standard arguments. 

Suppose that / G £'^{X, V) for some 5 > 0, with q < r. Thus / is bounded 
on X, as in the preceding paragraph, and 

(2.27) N{f{x)Y < ll/foo-^ N{f{x)Y < ll/li;-^ N{f{x)Y 

for every x G X. This implies that / is also r-summable on X, with 

(2.28) ii/ii; <11/11^11/11? = 11/11;, 

by summing (2.27) over x G X. Equivalently, 

(2.29) ll/ll. <11/11, 
for every / G £'^{X, V) when q < r. 

Of course, a E-valued function on X with finite support is r-summable for 
every r > 0 , so that 

(2.30) coo{X,V)C£YX,V) 

for every r > 0. It is not too difficult to check that coo(-^, V) is dense in £'^{X, V) 
for every r > 0, with respect to the appropriate metric, as before. The main 
point is that if / G £'^{X, V), then for each e > 0 there is a Hnite set A{e) C X 
such that 

(2.31) ^ N{fix)Y < e, 

x^X\A{e) 

as in (1.78) in Section 1.10. If N is the trivial ultranorm on V, then every 
r-summable function on X has finite support. 



22 


CHAPTER 2. NORMS ON VECTOR SPACES 


2.4 Bounded linear mappings 

Let V, W be vector spaces over k equipped with norms Ny, Nyy, respectively. A 
linear mapping T from V into W is said to be bounded if there is a nonnegative 
real number C such that 

(2.32) Nw{T{v)) <CNv{v) 
for every v £ V. This implies that 

(2.33) Nw(T{u) - T{v)) = Nw(T(u - v)) < CNv(,u - v) 

for every u,v £ V, and hence that T is uniformly continuous as a mapping from 
V into W, with respect to the metrics associated to their norms. It is easy to 
see that the collection BC{V, W) of bounded linear mappings from V into W is 
a vector space with respect to pointwise addition and scalar multiplication, as 
usual. 

As a simple class of examples, let X be a nonempty set, and consider the 
vector space coo(X, fc) of fc-vaued functions with finite support on X. Also let 
ll/lloo be the corresponding supremum norm on coo(X, fc), and let ||/||i be the 
C norm on coo(X, k), as in previous two sections. Thus 

(2.34) ll/IU < ll/lli 

for every / £ coo(X, k), as in (2.25), which implies that the identity operator I 
on coo(X, k) is bounded as a linear mapping from coo(X, k) equipped with the C 
norm into cqo (X, k) equipped with the supremum norm. Of course, there is an 
analogous statement for the standard inclusion of i^{X, k) in co(X, k). However, 
if X has infinitely many elements, then there is no (7 < oo such that 

(2.35) ||/||i<0 ll/IU 

for every / G coo(X, fc), which means that the identity operator I is not bounded 
as a linear mapping from coo{X,k) equipped with the supremum norm into 
coo(X, fc) equipped with the C norm in this case. If | . | is the trivial absolute 
value function on k, then the corresponding supremum norm ||/||oo on coo(X, k) 
is the same as the trivial ultranorm on coo(X, A:), and coo(X, fc) is the same 
as co(X, fc) and £^{X,k). The topology on coo(X, fc) determined by the metric 
associated to the C norm is the discrete topology, which is the same as the 
topology determined by the metric associated to the supremum norm. 

Let V and W be arbitrary vector spaces over k again, equipped with norms 
Ny and Nw, and let T be a linear mapping from V into W. Suppose that there 
is a positive real number r and a nonnegative real number A such that 

(2.36) Nw{T{v)) < A 

for every v £ V with Ny{v) < r. In particular, this condition holds when T 
is continuous at 0 as a mapping from V into W, with respect to the topologies 



2.4. BOUNDED LINEAR MAPPINGS 


23 


determined by the metrics associated to the corresponding norms. If | • | is not 
the trivial absolute value function on k, then it is easy to see that T has to be 
bounded as a linear mapping from V into W. This does not always work when 
I • I is the trivial absolute value function on A:, as in the preceding paragraph. 

If T is a bounded linear mapping from V into W, then the operator norm 
of T is defined by 

(2.37) \\T\\op = \\T\\op,vw = inf{C > 0 : (2.32) holds}. 

It is easy to see that (2.32) holds with C = ||r||op, so that 

(2.38) Nw{T{v))<\\T\\opNv{v) 

for every v €V. One can also check that ||T||op does define a norm on BC{V, W), 
which is an ultranorm when Nw is an ultranorm on W. If IT is complete with 
respect to the metric associated to the norm iVw, then one can verify that 
BC{V, IT) is complete with respect to the metric associated to the operator 
norm, by standard arguments. 

In some situations, the operator norm may be defined by 

(2.39) sup{iVw(r(u)) -.vGV, Nviv) < 1}, 

which is clearly less than or equal to (2.37). Similarly, one might consider 

(2.40) sup{r“^ Nw(T{v)) : v € T, Nyiv) < r} 

for any positive real number r, which is also automatically less than or equal 
to (2.37). Thus (2.37) is equal to (2.40) for some r > 0 when (2.32) holds with 
C equal to (2.40). In particular, this happens for every r > 0 when fc = R 
or C with the standard absolute value function, as one can see using scalar 
multiplication. 

Suppose that | • | is an absolute value function on a field k, and that | • | is 
not discrete on fc, in the sense described in Section 1.9. This means that there 
are t € k such that |t| 1 and |t| is as close as one wants to 1. In this case, 

one can check that the values of | • | on fc are dense in the set of nonnegative 
real numbers with respect to the standard metric on the real line, using integer 
powers of these elements t of k. This permits one to show that (2.37) is equal 
to (2.40) for every r > 0, using scalar multiplication again. 

Let I • I be any absolute value function on a field k again. If for each v GV 
there is a t G fc such that 

(2.41) Nv{v) = |t|, 

then the same type of argument can be used to show that (2.37) is equal to 
(2.39). Note that (2.40) is always the same as its counterpart with r replaced 
by r\t\ for any t G k with t ^ 0. If | • | is any nontrivial absolute value function 
on k, then (2.37) is less than or equal to a constant multiple of (2.40) for every 
r > 0, where the constant depends only on | • |. Of course, these types of 
arguments using scalar multiplication do not work so well when | • | is the trivial 
absolute value function on k. 



24 


CHAPTER 2. NORMS ON VECTOR SPACES 


Suppose that i? is a dense linear subspace of V, with respect to the metric 
on V associated to the norm Ny. Also let T be a bounded linear mapping from 
E into W, using the restriction of Ny to E. Thus T is uniformly continuous as 
a mapping from E into W with respect to the corresponding metrics, as before. 
If W is complete, then a well-known fact about metric spaces implies that T 
has a unique extension to a uniformly continuous mapping from V into W. In 
this situation, the extension of T is a bounded linear mapping from V into W, 
with the same operator norm as T has on E. 

2.5 Infinite series 

Let 14 be a vector space over k, and let iV be a norm on V. As usual, an infinite 
series with terms in V is said to converge if the corresponding sequence 

n 

(2.42) s„ = ^ Oj 

i=i 

of partial sums converges to an element of V. More precisely, this means that 
{s„}()Li converges to an element of V with respect to the metric d{-, •) associated 
to N, as in (2.5) in Section 2.1. In this case, the value of the sum is 

defined to be the limit of the sequence {s„}))Tj. 

Note that the sequence of partial sums is a Cauchy sequence in V 

with respect to if and only if for each e > 0 there is an A > 1 such that 

n 

(2.43) fv( ^ a,) < e 

3 = 1+1 

for every n > I > L. In particular, this implies that converges to 0 in 

V, by taking n = I + 1. Of course, is a Cauchy sequence in V when 

converges in V, and the converse holds when V is complete with respect 

to d{-, •). 

If 

OO 

(2.44) 

i=i 

converges as an infinite series of nonnegative real numbers, then we say that 
converges absolutely. It is easy to see that this implies that {sn}^^i is 
a Cauchy sequence in V, because 

n n 

(2.45) iv( ^ a,) < 

3=1+1 j=l+l 

for each n > / > 1. If C is complete, then it follows that converges in 

V, in which case we also have that 

OO OO 

i=i i=i 


(2.46) 



2.6. GENERALIZED CONVERGENCE 


25 


Similarly, if N is an ultranorm on V, then 

n 

(2.47) iv( ^ a,) < m^xN{a,) 

j=i+i 

for every n > I > 1. This implies that is a Cauchy sequence in V when 

converges to 0. If V is complete, then it follows that converges 

in V under these conditions, and that 

OO 

(2.48) <m&xN{aj). 

i=i 

Note that the maximum on the right side of (2.48) is attained when N{aj) —> 0 
as j —> OO, as a sequence of nonnegative real numbers. 


2.6 Generalized convergence 

Let V be a vector space over k again, equipped with a norm N. Also let X be 
a nonempty set, and let / be a function on X with values in V. If A is a finite 
subset of X, then the sum 

(2.49) ^ fix) 

x^A 

can be defined in the usual way, where (2.49) is interpreted as being equal to 0 
when A = 0. Of course, the collection of finite subsets of X is partially ordered 
by inclusion. In fact, the collection of finite subsets of A is a directed system, 
because any two finite subsets Ai, A 2 of A are contained in the finite subset 
Ai U A 2 of A. Thus the family of finite sums (2.49) may be considered as a 
net of elements of V indexed by the collection of finite subsets of A, and so the 
convergence of the sum 

(2.50) ^ fix) 

x<^X 

may be defined in terms of the convergence of this net in V. More precisely, 
this net converges to an element v of if for every e > 0 there is a finite set 
^(e) C A such that 

(2.51) a(^/(x)-^;) <e 

x^A 

for every finite set A C A such that A(e) C A. It is easy to see that the limit v 
of this net is unique when it exists, in which case the value of the sum (2.50) is 
defined to be v. If A has only finitely many elements, then this reduces to the 
usual definition of the sum (2.50). 

Similarly, the net of finite sums (2.49) is a Cauchy net in V if for each e > 0 
there is a finite set Ai(e) C A such that 

f(x)) < e 

xGA xGA' 


(2.52) 



26 


CHAPTER 2. NORMS ON VECTOR SPACES 


for any two finite sets A,A' C X such that Ai{e) C A, A'. If the net of finite 
sums (2.49) converges in V, then it is easy to see that is a Cauchy net. This 
follows from the triangle inequality, with Ai{e) taken to be the set A{e/2) in 
the definition of convergence of the net. If N is an ultranorm on V, then one 
can take Ai{e) = A(e). 

As a variant of this, let us say that the sum (2.50) satisfies the generalized 
Cauchy eriterion if for each e > 0 there is a finite subset Aq (e) of X such that 

(2.53) iv(^/(x))<e 

x^B 

for every finite set B C X that satisfies Ao(e) fl i? = 0. If the net of finite sums 
(2.49) is a Cauchy net, then the sum (2.50) satisfies the generalized Cauchy 
criterion, with Ao(e) = Ai{e) for each e > 0. More precisely, if B C X is 
a finite set such that Ai{e) C B = 0, then we can take A = Ai{e) U B and 
A' = Ai{e) in (2.52) to get (2.53). Conversely, if the sum (2.50) satisfies the 
generalized Cauchy criterion, then the net of finite sums (2.49) is a Cauchy net, 
with Ai{e) = Ao(e/2) for each e > 0. To see this, let e > 0 be given, and let 
A,A'CX be finite sets such that Ao(e/2) C A,A'. Put B = A \ (A n A') and 
= A' \ (A n A'), so that 

(2.54) f{x) - Y /(^) = 

xGA x^A' x^B x^B' 

and hence 

(2.55) iv( ^ f{x) - Y /(^)) < ^( E + ^( E /(^)) • 

x^A x^A' xGB xGB' 

By hypothesis, both terms on the right side of (2.55) are less than e/2, since B 
and B' are disjoint from Ao(e/2). This implies that (2.52) holds, as desired. If 
N is an ultranorm on V, then one can take Ai(e) = Ao(e) for every e > 0, by 
an analogous argument. 

Suppose that / is summable as a fo-valued function on X, as in Section 
2.3, and let us check the sum (2.50) satisfies the generalized Cauchy criterion. 
Remember that the summability of / on X means that N{f{x)) is summable as 
a nonnegative real-valued function on X, as in Section 1.10. This implies that 
for each e > 0 there is a finite subset Ao(e) of X such that 

(2.56) ^ Nifix)) < e, 

a:eX\Ao(e) 

as in (1.78) in Section 1.10. If i? is a finite subset of X \ Ao(e), then it follows 
that 

(2.57) iv( ^ fix)) < Y Nifix)) < Y 

x&B x&B xGX\Ao{e) 

as desired. Similarly, if N is an ultranorm on V, and if / is a fo-valued function 
on X that vanishes at infinity, then the sum (2.50) satisfies the generalized 



2.7. GENERALIZED CONVERGENCE, CONTINUED 


27 


Cauchy criterion. In this case, we can simply take 

(2.58) Aoie) = {x G X : N{f{x)) > e} 

for each e > 0, which has only finitely many elements by hypothesis. If i? is a 
finite subset of X \ Ao(e), then 

(2.59) - maxIV(/(a;)) < e, 

x^B 


as desired. 


2.7 Generalized convergence, continued 

Let y be a vector space over k equipped with a norm N again, and let / be a 
C-valued function on a nonempty set X such that the sum (2.50) satisfies the 
generalized Cauchy criterion. This implies that / vanishes at infinity on X, by 
applying (2.53) to sets B with exactly one element. In particular, it follows that 
the support of / has only finitely or countably many elements, as in Section 2.2. 
Of course, if the support of / has only finitely many elements, then the sum 
(2.50) can be defined in the usual way. 

Otherwise, let {xj}"^^ be a sequence of distinct elements of X such that 
every element of the support of / is of the form xj for some j. It is easy to see 
that the infinite series 

OO 

(2.60) 

satisfies the usual Cauchy criterion under these conditions, as in Section 2.5, 
so that the partial sums of this series form a Cauchy sequence in V. If V is 
complete with respect to the metric associated to N, then it follows that (2.60) 
converges as an infinite series in V. Using the generalized Cauchy criterion for 
the sum (2.50) again, one can check that the net of finite sums (2.49) converges 
in V to the same value of the sum (2.60), as in Section 2.6. 

If / is summable on X, then (2.50) satisfies the generalized Cauchy criterion, 
as in the previous section. In this case, it is easy to see that 

(2.61) iv( ^ fix)) < Nifix)) 

xGX xGX 

when V is complete with respect to the metric associated to N, so that the net 
of finite sums (2.49) converges in V. Similarly, if N is an ultranorm on V and / 
vanishes at infinity on X, then we have seen that (2.50) satisfies the generalized 
Cauchy criterion again. If V is also complete with respect to the ultranorm 
associated to N, so that (2.50) is defined as an element of V, then we have that 

Y - maxX(/(x)). 


( 2 . 62 ) 



28 


CHAPTER 2. NORMS ON VECTOR SPACES 


Note that the maximum on the right side of (2.62) is attained when / vanishes 
at infinity on X. 

If / is a nonnegative real-valued function on X which is summable, then it is 
easy to see that the net of Hnite sums (2.49) converges to their supremum. Thus 
the dehnition of the sum (2.50) in Section 1.10 is equivalent to the definition of 
the sum in Section 2.6 in this situation. Similarly, if / is a nonnegative real¬ 
valued function on X that is not summable, then the finite sums (2.49) tend to 
-|-oo in a suitable sense. If / is a real or complex-valued summable function on 
X, then / can be expressed as a linear combination of summable nonnegative 
real-valued functions on X. This gives another way to look at the convergence 
of the net of finite sums (2.49) in this case. 

An infinite series with terms in V may be considered as a sum over X = Z^, 
to which the earlier discussion applies. If the corresponding net of all Hnite 
subsums of such a sum over Z+ converges in V, then the partial sums of any 
rearrangement of the series converges to the same value. Similarly, if a sum over 
Z-|_ satisfies the generalized Cauchy criterion, as in Section 2.6, then the sequence 
of partial sums of any rearrangement of the series is a Cauchy sequence. In this 
case, the convergence of any of these Cauchy sequences implies the convergence 
of the whole net of finite sums to the same value, as before. 

Let X be any nonempty set again, and let / be a C-valued function on X 
such that the sum (2.50) satisfies the generalized Cauchy criterion, as in the 
previous section. If E is any nonempty subset of X, then it is easy to see that 
the restriction oi f to E has the same property, so that 

(2.63) ^ fix) 

x^E 

satisfies the generalized Cauchy criterion as a sum over E. If V is complete with 
respect to the metric associated to N, then it follows that the net of all finite 
subsums of (2.63) converges to an element of V for each E C X, as before. If 
El and E 2 are pairwise-disjoint subsets of X, then one can also check that 

(2-64) fix) = Y /(2^) + 

X&E1UE2 xGEi x^E 2 


under these conditions. 

Suppose now that / is a C-valued function on a nonempty set X such that 
the sum (2.50) does not satisfies the generalized Cauchy criterion. This means 
that there is an e > 0 such that for each finite set A C X there is a finite set 
B f- X\A that satisfies 

(2.65) iv(^/(x)) >e. 

li X = Z+, then one can use this to find a rearrangement of the infinite series 
EiJLi fU) which the corresponding sequence of partial sums does not form 
a Cauchy sequence. Alternatively, if X = Z+, then one can use this to find a 
strictly increasing sequence {ji}fZi of positive integers such that the sequence 



2.8. BOUNDED FINITE SUMS 


29 


of partial sums 

n 

(2.66) ^/(jO 
does not form a Cauchy sequence in V. 

2.8 Bounded finite sums 

Let be a vector space over k with a norm N again, and let X be a nonemnpty 
set. Let us say that a y-valued function f on X has bounded finite sums if the 
sums 

(2.67) ^ fix) 

xGA 

over all finite subsets A oi X have bounded norm in V. More precisely, this 
means that there is a nonnegative real number C depending on / such that 

(2.68) iv(^/(a:))<C 

xGA 

for every finite set A C X. It is easy to see that the space BFS{X, V) oiV- 
valued functions on X with bounded finite sums is a vector space with respect 
to pointwise addition and scalar multiplication, and that 

(2.69) ||/||bFS = \\f\\BFS{X,V) 

= sup : A is a finite subset of X I 

defines a norm on BFS{X,V). In fact, BFS{X,V) is a linear subspace of 
£°°(X,V), and 

(2.70) ll/lloo < WIWbfs 

for every / € BFS{X,V), as one can see by restricting one’s attention to 
subsets A of X with exactly one element. If N is an ultranorm on V, then every 
bounded F-valued function on X has bounded finite sums, so that BFS{X, V) 
is the same as £°°{X, V), and 

(2.71) ll/lloo = ||/||bfs 

for every / e £°°{X, 1^). liV is complete with respect to the metric associated 
to X, then one can check that BFS{X, V) is complete with respect to the metric 
associated to the BFS norm, by standard arguments. 

Let / be a F-valued function on X for which the sum 

(2.72) ^ fix) 

x£X 

satisfies the generalized Cauchy criterion, and let us check that / has bounded 
finite sums. To do this, let ^o(l) be a finite subset of X such that (2.53) in 



30 


CHAPTER 2. NORMS ON VECTOR SPACES 


Section 2.6 holds with e = 1 for every finite set B C X that is disjoint from 
^o(l)- If ^ is any finite subset of X, then 

(2.73) f{x) = Y 

xeA xeAnAo(l) a:eA\Ao(l) 

so that 

(2.74) iv(^/(a:)) <fv( ^ fix))+N(^ Y 

a:eA £!;GAnAo(l) £!;GA\Ao(1) 

Because A \ ^o(l) is disjoint from Ao(l), we get that 

(2.75) iv(^/(a:))< ^ iV(/(a:)) + l, 

x^A a:GAo(l) 

using the triangle inequality to estimate the first term on the right side of (2.74). 
This shows that the sums (2.49) have bounded norm, and of course one could 
get a better estimate when N is an ultranorm on V. 

Let GCC{X, V) be the space of F-valued functions f on X such that (2.72) 
satisfies the generalized Cauchy criterion, as in Section 2.6. The argument in 
the preceding paragraph implies that 

(2.76) GCC{X, V) C BFS{X, V), 

and it is easy to see that GCG{X, C) is a linear subspace of BFS{X, V). One 
can also check that GCG(X, V) is a closed set in BFS{X, V), with respect to 
the metric on BFS{X, V) associated to the BFS norm. Of course, 

(2.77) coo{X,V)CGCG{X,V), 

and in fact GCG{X,V) is the same as the closure of coo(-^, V^) in BFS{X,V) 
with respect to the BFS norm. Note that 

(2.78) GGC{X, V) C coiX, V), 

as mentioned at the beginning of Section 2.7, and that 

(2.79) GCG{X, V) = co(X, V) 

when N is an ultranorm on V, as indicated near the end of Section 2.6. 

If N is any norm on V and / is a V^-valued summable function on X, then 

( 2 . 80 ) n( Y f(^)) < E ^(/(^)) ^ ll/lli 

x^A x^A 


for every finite set A C X. Thus / has bounded finite sums on X, and 

(2.81) WfWsFS < ll/lli- 



2.8. BOUNDED FINITE SUMS 


31 


More precisely, 

(2.82) e\X, V) C GCCiX, V), 

as in Section 2.6. Alternatively, we have seen that coo(A, V) is dense in I^{X, V) 
with respect to the norm, as in Section 2.3. This implies that every y-valued 
summable function f on X can be approximated by functions with finite support 
with respect to the BFS norm, by (2.81), so that / G GCC{X,V), as in the 
preceding paragraph. 

Let / be a real-valued function on X with bounded finite sums with respect 
to the standard absolute value function on R, so that 


(2.83) 


xeA 


.fix) 


< G 


for some nonnegative real number G and every finite set A Q X. This implies 
that 

(2.84) ^|/(a;)|<2C 

x&A 

for every finite set A C A, by applying (2.83) to the subsets of A consisting 
of a: G A such that f{x) > 0 or f{x) < 0, respectively. It follows that / is a 
summable function on X under these conditions, with 


(2.85) 


E l/(^)l<2C'. 

x&X 


Similarly, if / is a complex-valued function on X with bounded finite sums with 
respect to the standard absolute value function on C, then one can apply the 
previous remarks to the real and imaginary parts of /, to get that / is summable 
on X. 

Let y be a vector space over k with a norm N again, and suppose that V 
is complete with respect to the corresponding metric. If / G GGG{X, V), then 
the net of finite sums (2.67) converges in V, as in Section 2.7. The value of the 
sum (2.72) is defined to be the limit of this net, which satisfies 

( 2 . 86 ) N(j^f(x))<llfllBFS- 

xGX 

Thus 

(2.87) / ^ E 

x£X 

defines a bounded linear mapping from GGG{X, V) into V, using the restriction 
of the BFS norm to GGG{X,V). More precisely, it is easy to see that the 
operator norm of (2.87) is equal to 1, by considering F-valued functions f(x) 
on X that are equal to 0 at all but one point in X. 

Of course, the sum (2.72) can be defined in the usual way when / has finite 
support on A, so that (2.87) may be considered initially as a linear mapping from 
coo(-A, V) into V. In this case, (2.86) follows directly from the definition of the 



32 


CHAPTER 2. NORMS ON VECTOR SPACES 


BFS norm, so that (2.87) is a bounded linear mapping with respect to the BFS 
norm on coo(^, V). If 1^ is complete, then this mapping has a unique extension 
to a bounded linear mapping from the closure of coo(-’7, V) in BFS{X, V) into 
V, by the remarks at the end of Section 2.4. We have also seen that the closure 
of coo(-^, V) in BFS{X, V) is the same as GCC{X, V). This gives another way 
to look at (2.87) as a bounded linear mapping from GCC{X,V) into V, with 
respect to the BFS norm on GCG{X, V). 

2.9 Sums of sums 

Let y be a vector space over k equipped with a norm N, and let us suppose 
throughout this section that V is complete with respect to the associated metric. 
Also let AT be a nonempty set, and let / be a F-valued function on X such that 

(2.88) ^ fix) 

xGX 

satisfies the generalized Cauchy criterion, as in Section 2.6. If E is any subset 
of X, then it follows that 

(2.89) ^ fix) 

x^E 

satisfies the generalized Cauchy criterion too, as mentioned in Section 2.7. This 
implies that the net of all Hnite subsums of (2.89) converges in V, because V is 
complete, as in Section 2.7 again. We also have that 

(2.90) N(^'^fix)"j < \\f\\BFS{E,V) < \\f\\BFS{X,V) 

xGE 

for every E C X, as in (2.86) in the previous section. Here \\f\\BFS{x,v) is the 
usual BFS norm of / on X, as in (2.69), and \\f\\BFS{Ey) refers to the BFS 
norm of the restriction of / to E. Of course, if / is summable on X, then the 
restriction of / to any set E C X is summable, and 

(2.91) iv( ^ fix)) < Nifix)) < ^(/(^)). 

xGE xGE xGX 

as in (2.61) in Section 2.7. If N is an ultranorm on V, then it suffices to ask that 
/ vanish at infinity on X, as in Section 2.6, which implies that the restriction 
of / to any set E C X vanishes at inhnity on E. In this case, we have that 

(2.92) n(^ ^ fix)) < ma^Nifix)) < maxNifix)) 

x^E 

for each E C X, as in (2.62) in Section 2.7. 

Let / be a nonempty set, and let {Ej}j^j be a family of pairwise-disjoint 
subsets of X. Thus 

(2-93) a(j) = /(^) 

xeEj 



2.9. SUMS OF SUMS 


33 


is defined as an element of V for each j € I, as in the preceding paragraph. If 
ji, ... ,jn are finitely many distinct elements of I, then 

n 

(2-94) '^a{ji)= f{x), 

'=1 -6Ur=i 

as in (2.64) in Section 2.7. It follows that 

n 

(2-95) ^ - \\f\\BFSix,v), 

as in (2.90). This implies that a has bounded finite sums on I, with 
(2-96) l|a||BFS(/,y) < ll•^llBFs(lJ. Ej,v) - ll/ll-B^‘S'(x,y)- 

One can also check that 

(2.97) ^a(j) 

j&i 

satisfies the generalized Cauchy criterion under these conditions, using (2.95) 
and the analogous property of /. More precisely, one can verify that 

(2.98) Yaij)= Y 

by considering approximations of the various sums by finite subsums, and where 
the right side of (2.98) is defined as in the previous paragraph. If / is summable 
on X, then it is easy to see that a is summable on I, with 

(2.99) l|a|Ui(rT) ^Y'Yl -^(/(^)) - Wfhnx.v)- 

jGl xeEj 


Similarly, if N is an ultranorm on V, and / vanishes at infinity on X, then one 
can check directly that a vanishes at infinity on I, and that 

(2.100) ||a||^~(/.v) < maxmax Af(/(x)) < \\f\\e^(x,v)- 

jGl x^Ej 

Note that (2.93) defines a linear mapping from / € GCC{X, V) into the 
vector space of C-valued functions a on I. More precisely, this is a bounded 
linear mapping from GCC{X,V) into BFS{I,V) with respect to the BFS 
norms on X and I, by (2.96). If / has finite support in X, then a has finite 
support in I, because the Ej’s are pairwise disjoint. Using this, it is easy to see 
that this mapping sends GCG{X, V) into GCG{I, V), since GCG{X, V) is the 
closure of CQoiX, V) in BFS{X, V), and similarly for GCG{I, V). Clearly (2.98) 
holds when / has finite support in Jf, which implies that (2.98) holds for every 
/ G GCG{X,V), since coo(-^, U) is dense in GGC{X,V) with respect to the 



34 


CHAPTER 2. NORMS ON VECTOR SPACES 


BFS norm. One can also look at (2.93) as defining a bounded linear mapping 
from i^(X, V) into V), by (2.99). If N is an ultranorm on V, then (2.93) 
can be used to initially define a bounded linear mapping from co(X,V) into 
V) with respect to the £°° norms on X and I, by (2.100). As before, this 
mapping sends coo(A', y) into coo{I,V), which implies that it sends co{X,V) 
into co(/, V) in this case. 

Suppose now that X = Y x Z is the Cartesian product of two nonempty 
sets Y and Z. If / is a C-valued function on X such that (2.88) satisfies the 
generalized Cauchy criterion, as before, then 

( 2 . 101 ) 

yeY 

satisfies the generalized Cauchy criterion for each z € Z, and 
( 2 . 102 ) 

zez 

satisfies the generalized Cauchy criterion for each y G Y. Here /(y, z) refers to 
the value of f at x = {y, z) £ Y x Z for each y £Y and z £ Z, so that (2.101) 
and (2.102) are simply sums of / over subsets of y x Z. Thus the sum (2.101) is 
defined as an element of V for each z £ Z, because V is complete, and similarly 

(2.102) is defined as an element of V for each y £Y. We also have that 

(2.103) E(E.C(2^-^)) 

zeZ yGY 

satisfies the generalized Cauchy criterion as a sum over z £ Z, and that 

(2.104) E(E-C(2^>^)) 

yGY z^Z 

satisfies the generalized Cauchy criterion as a sum over y £ Y. Both of these 
statements may be considered as instances of the analogous statement for (2.97) 
discussed earlier. Using (2.98), we get that (2.103) and (2.104) are both equal 
to 

(2.105) ^ f{y,z). 

(y,z)eYxZ 

If / is summable on X, then all of these sums are sums of summable functions on 
the corresponding sets, as before. If N is an ultranorm on X and / vanishes at 
infinity on X, then one can check directly that these sums are sums of functions 
that vanish at infinity on the corresponding sets. 


2.10 Finite-dimensional vector spaces 

Let n be a positive integer, so that /c” may be considered as a vector space over 
k, as in Section 2.1. Also let lU be a vector space over k, and let T be a linear 



2.10. FINITE-DIMENSIONAL VECTOR SPACES 


35 


mapping from fc” into W. The standard basis vectors e(l),..., e(n) in fc” can 
be defined in the usual way, so that the Ith coordinate of e(j) is equal to 1 when 
j = I, and to 0 otherwise. Thus each w = (vi,..., v„) G fc" may be expressed as 

n 

(2.106) v = ^Vje{j), 

i=i 


which implies that 

n 

(2.107) T(u)=^u,T(e(j)). 

i=i 

Let Nq be the norm on fc” defined in (2.7), and let Nw be a norm on W. 
Observe that 

n n 

(2.108) Nw{T{v)) <^Nw{vje{j)) = '^\vj\Nw{e{j)) 

1=1 i=i 

n 

< (j2Nw{e{j)))No{v) 

i=i 

for every v G fc", by (2.107). This implies that T is a bounded linear mapping 
from fc” into W, with 

n 

(2.109) nUp<^iVH^(e(j)), 

1=1 

as in Section 2.4. If Nw is an ultranorm on W, then we get that 

(2.110) lVw(T(u)) < max Nw{vj e{j)) = max {\vj\ Nw{e{j)) 

< (^max^iVvy(e(j))) iVo(u) 

for every u G A:”, and hence 

(2.111) \\T\\op < max Nw{e{j)). 

l<j<n 

More precisely, 

(2.112) \\T\\,p = max Nw{,e{j)) 

l<j<n 

in this case, because equality holds in (2.110) with v = e{l) for some 1. Note 
that Nq satisfies the condition indicated in (2.41) in Section 2.4, which implies 
that the operator norm of T can be given as in (2.39) in that section. 

If N is any norm on /c”, then there is a positive real number Ci such that 

(2.113) N{v)<CiNo{v) 

for every v G A:". This follows from (2.108) applied to IT = A;”, Nw = N, and T 
equal to the identity mapping on A;”. If A: is complete with respect to the metric 



36 


CHAPTER 2. NORMS ON VECTOR SPACES 


associated to | • |, then one can show that there is also a positive real number 
C 2 such that 

(2.114) No{v)<C2N{v) 

for every v € fc". See Lemma 2.1 on pll6 of [2], or Theorem 5.2.1 on pl37 of 
[12]. This implies that the topology on fc" determined by the metric associated 
to N is the same as the topology determined by the metric associated to Nq. 

As in Section 2.1, the topology on fc” determined by the metric associated 
to No is the same as the product topology corresponding to the topology on 
k determined by the metric associated to the absolute value function. If k is 
locally compact, then fc” is locally compact with respect to this topology as 
well. Of course, if | • | is the trivial absolute value function on k, then Nq is the 
trivial ultranorm on A:”, and the corresponding topologies are discrete. Suppose 
for the moment that | • | is not the trivial absolute value function on k, and 
that k is locally compact with respect to the topology determined by the metric 
associated to | • |. In this case, it is easy to see that all closed balls in k are 
compact, so that closed and bounded subsets of k are compact. It follows that 
closed balls in fc" with respect to the metric associated to Nq are compact too, 
by Tychonoff’s theorem. This implies that closed and bounded subsets of /c” 
are compact too. Note that k is complete when k is locally compact, as one can 
check using the fact that compact metric spaces are complete. 

It is a bit simpler to show (2.114) when | • | is nontrivial on k and k is locally 
compact, so that 

(2.115) {«€*”: No{v) = 1} 

is a compact subset of /c”. This also uses the fact that N is continuous as a 
real-valued function on A:" with respect to the metric associated to Nq, which 
can be derived from (2.113). If k is only asked to be complete with respect to 
the metric associated to | • |, then one can use induction on n to prove (2.114). 
The base case n = 1 is easy, and when n > 2 the induction hypothesis implies 
that a condition like (2.114) holds on A:"“^ x {0}. In particular, it follows that 
AjK-i ^ |Q| jg complete with respect to the metric associated to N, because k 
is complete, by hypothesis. This implies that x {0} is a closed subset of 
A:” with respect to the metric associated to N, by standard arguments. If e(n) 
is the nth standard basis vector in A:", as before, then it follows that there is a 
positive lower bound for the distances between e(n) and elements of A:"“^ x {0} 
with respect to the metric associated to N. Equivalently, this means that |v„| 
is bounded by a constant multiple of N{v) for each u G fc". This permits a 
condition like (2.114) to be obtained on fc" from an analogous condition on 
A:”-i X {0}. 

2.11 g-Norms 

Let E be a vector space over k, and let g be a positive real number. Also let 
A^ be a nonnegative real-valued function on V that satisfies the same positivity 
and homogeneity conditions as for a norm, as in (2.1) and (2.2) in Section 2.1. 



2.11. Q-NORMS 


37 


Let us say that TV is a q-norm on V if 

(2.116) N(v + w)'^ < N(v)'^+ N(w)'^ 

for every v,w €V. Of course, (2.116) is the same as the usual triangle inequality 
(2.3) when g = 1, so that a 1-norm is the same as a norm. If N is an ultranorm 
on V, then 

(2.117) N{v + w)* < max(iV(r!),iV(r(;))* < N{vy + N{wy 

for every v,w €V and g > 0, so that is a g-norm on V for every g > 0. 

Note that (2.116) is equivalent to asking that 

(2.118) iV(u-fw) < (TV(v)«-f TV(w)9)i/« 
for every v,w €V. Clearly 

(2.119) max(TV(v), A^(ri;)) < 

for every v,w €V and g > 0, which is the same as the second step in (2.117). 
We also have that 

(2.120) N{vy + N{wy < 2 max(iV(r;)«, N{wy) 
for every v,w €V and g > 0, so that 

(2.121) (TV(t>)« -h TV(w)«)i/« < 2i/« max(TV(r;), N{w)). 

It follows from (2.119) and (2.121) that 

(2.122) lim {N{vy + TV(w)'^)^/'^ = max(TV(w), N{w)) 

for every v,w GV, since 2^/® 1 as g ^ oo. Thus one might interpret (2.118) 

as being the ultrametric version (2.4) of the triangle inequality when g = oo. 

If 0 < g < r < oo, then 

(2.123) N{vY + N{wY < max{N{v), N{w)Y~‘^ {N{vY + N{wY) 
for every v,w € V. This implies that 

(2.124) N{vY + N{wY < {N{vY + iV(w)«)(’'-«)/«+i = (TV(t>)« -h N{wYY^Y 
for every v,w €V, using (2.119) in the first step. Thus 

(2.125) iN{vY + N{wYY^"' < {N{vY + N{wYY^’^ 

for every v,w G V when g < r, by taking the rth root of both sides of (2.124). 
This could also be derived from (1.18) or (1.19) in Section 1.2, or from (2.29) 
in Section 2.3. If N is an r-norm on V, then it follows that TV is a g-norm on V 
too when q < r. 



38 


CHAPTER 2. NORMS ON VECTOR SPACES 


Suppose for the moment that 

(2.126) |t + t'|« < |t|« + |t'|« 

for some q > 0 and every t,t' G k, so that |t|^ is also an absolute value function 
on k. If TV is a q-norm on V, then 

(2.127) N{vy 

may be considered as a norm on V with respect to |t|® as an absolute value 
function on k. More precisely, the homogeneity condition (2.2) for N with 
respect to |t| on k implies that N{v)‘^ satisfies the analogous condition with 
respect to on k. Similarly, (2.116) is the same as the standard triangle 
inequality for N{v)‘^ as a norm on V with respect to 

Suppose now that V ^ {0}, and that iV is a g-norm on V for some q > 0. 
Let M be a nonzero element of V, and let t, t' be arbitrary elements of k. If we 
apply (2.116) to v = tu and w = t' u, then we get that 

(2.128) \t + t'\‘^ Niuy = N{tu + t'u)‘^ < Nituy+ Nit'u)‘^ 

= \t\<^Niuy + \t'\<^Niuy, 

using also the homogeneity property (2.2) of iV with respect to | • |. This shows 
that (2.126) holds under these conditions, since N{u) > 0. 

If iV is a norm on V, then 

(2.129) N(v-w) 

defines a metric on V, as in (2.5) in Section 2.1. Similarly, if is a q-norm on 
V, then 

(2.130) N{v-w)'^ 

defines a metric on V, which is the same as (2.129) when q = 1. If is a g-norm 
on V and |t|* is an absolute value function on k, then N{v)'^ may be considered 
as a norm on V with respect to on fc, as before, and (2.130) is the same as 
the metric associated to this norm. Of course, if A^ is a g-norm on V and q > 1, 
then A^ is a norm on V too, so that (2.129) is a metric on V as well, which 
determines the same topology on V as (2.130). If q < 1, then (2.129) is at least 
a quasimetric on V", as in Section 1.2. 

Suppose that A^ is a g-norm on 1^ ^ {0} for some <7 > 0, and let AT be a 
nonempty set. Also let / be a y-valued function on X such that 

(2.131) Nifix)r 

is summable as a nonnegative real-valued function on X, as in Section 1.10. 
As before, \t\‘^ is an absolute value function on k under these conditions, and 
NivY may be considered as a norm on V with respect to |t|* on k. Thus the 
summability of (2.131) on X is the same as the summability of / as a F-valued 
function on X with respect to N{vY as a norm on V with respect to |t|* on 
k. This implies that /(^) satisfies the generalized Cauchy condition with 

respect to N{vY as a norm on V with respect to \t\‘^ on k, as in Section 2.6. 



2.12. NORMS, CONTINUED 

2.12 Norms, continued 


39 


Let V 7 ^ {0} be a vector space over k again, and let be a g-norm on V for 
some q > 0. It follows that |t|^ is an absolute value function on k too, as in 
the previous section, and that N{v)‘^ may be considered as a norm on V with 
respect to \t\'^ on k. Also let A be a nonempty set, and let r be a positive real 
number. As in Section 2.3, a fo-valued function / on A is said to be r-summable 
with respect to A on fo if 

(2.132) N{fix)r 

is summable as a nonnegative real-valued function on A. Let us denote the 
space of r-summable fo-valued functions on A by U{X,V), as before, or by 
^)y(A, V), to indicate the role of N. 

Of* pniircjp 

(2.133) ’ N{f{x)y = (A(/(x))«)’-/^ 

for every a; G A, so that / is r-summable with respect to A on fo if and only if 
/ is (r/g)-summable with respect to A(-u)^ as a norm on V with respect to \t\'^ 
on k. Thus 

(2.134) £y{X,V)=fJnX,V), 

where A^ is considered as a norm on V with respect to |t|* on fc on the right side 
of (2.134). The discussion in Section 2.3 implies that the right side of (2.134) 
is a vector space with respect to pointwise addition and scalar multiplication of 
fo-valued functions on A, so that the same conclusion holds for the left side of 

(2.134) . 

Put 

(2.135) ll/ll, = WfWrix.v) = WfWr^ixy) = ( E ^(/(^))’') 

x&X 

for each / G P^iX, V), as in (2.19). It is easy to see that this satisfies the usual 
positivity and homogeneity conditions for a norm, because of the corresponding 
properties of A. Note that 

(2.136) = (E = (E NifixyyY'' 

x£X X^X 

for every / in (2.134), so that 

(2.137) WfWtyyx.v) = i\\f\h(x,v)y- 

Suppose for the moment that q = 1, so that A is a norm on V. If r > 1, 
then (2.135) dehnes a norm on l'y{X,V), by (2.21). Similarly, if 0 < r < 1, 
then (2.135) defines an r-norm on V, by (2.22). If A is an ultranorm on V, 
then (2.22) holds for every r > 0, as mentioned in Section 2.3. This implies that 
(2.135) is an r-norm on £y{X, V) for every r > 0 in this case. 

Now let A be a g-norm on V for some g > 0 again, so that N{vy is a norm 
on V with respect to |t|'^ on k. It follows that (2.136) is a norm on 4yx,v) 



40 


CHAPTER 2. NORMS ON VECTOR SPACES 


when r/q > 1, and that (2.136) is an (r/q)-norm on when r/q < 1, 

as in the preceding paragraph. More precisely, we still use |t|^ as the absolute 
value function on k for these two statements, but the main point is the version 
of the triangle inequality that we get. If r/q > 1, then we have that 


(2.138) 





ix,v) 



ix,v) 


for every /, g 
(2.139) 


If r/q < 1, then 




r/q 

r/q 


(XV) 


< 


\r/q 



(XV) 


for every /,g e f^iX, V). 

These two statements can be reformulated in terms of (2.135), using (2.134) 
and (2.137). If r > q, then (2.138) implies that 


(2-140) 11/+ 5ll^^(x,y) — \\f\\r^(xv) 

for every f,g G Pj^{X,V), so that (2.135) defines a q-norm on Pj^{X,V). If 
r < q, then (2.139) implies that 

(2-141) 11/ + 5ll/^(x,y) ^ ll/ll^ 5 ;,(js:,y) + llffll^;,(x,y) 

for every f,g € Pn{X, V), so that (2.135) defines an r-norm on Pj^{X, V). 

Let us take £°°lx,V) = £]^{X,V) to be the space of F-valued functions / 
on X that are bounded, in the sense that 


(2.142) 




is bounded as a nonnegative real-valued function on X. Of course, this is the 
same as saying that N{f{x))'^ is bounded on X, so that (2.134) also holds when 
r = oo. In particular, £/^{X,V) is a vector space with respect to pointwise 
addition and scalar multiplication, as in Section 2.2. If we put 

(2.143) ll/lloo = ||/||^~(x.y) = ll/ll^“(jf.y) = sup N{f{x)) 

x^X 

for every / e £]^{X, V), as in (2.10), then (2.137) holds when r = oo too. It is 
easy to see that (2.143) defines a q-norm on £/^{X,V) under these conditions, 
directly from the definitions, or using (2.137) with r = oo to reduce to the case 
of norms. 



Chapter 3 


Additional examples and 
results 

3.1 Cauchy products 

Let infinite series with terms in a field k, and put 

n 

(3.1) Cn = ^ ( Qj ^n—j 

j=0 

for each nonnegative integer n. The infinite series known as the 

Cauchy product of the series easy to see that 

OO OO C30 

(3.2) = (^a,) (^5;) 

n=0 j=0 1=0 

formally. In particular, this holds when aj = 0 for all but finitely many j > 0 
and 6; = 0 for all but finitely many ? > 0, in which case c„ = 0 for all but 
finitely many n. We can look at this in terms of the discussion in Section 2.9, 
with 

(3.3) X = (Z+U{0})x (Z+U{0}). 

Put 

(3.4) En = {{j,l)€X ■.j + l = n} 

for each nonnegative integer n, so that is a finite set with exactly n + 1 
elements for each n > 0, the E^s are pairwise disjoint, and 

OO 

(3.5) X = U S„. 

n—0 


41 



42 


CHAPTER 3. ADDITIONAL EXAMPLES AND RESULTS 


If / S coo{X, k), then it follows that 

OO 

(3-6) = 51 ( 51 ^)) • 

0',z)ex n=0 0-,i)6-En 

Let / be the /c-valued function on X defined by 

(3.7) f{j, 1) = aj h 
for each j,l >0, so that 

(3.8) -fiu 0 = c„ 

UO&Er, 

for every n > 0. If Oj = 0 for all but finitely many j, and bi = 0 for all but 
finitely many I, then / € coo(X, k), and 

OO OO 

(3-9) (51 “j) (51 ^0 ■ 

U,i)ex j=o 1=0 

This corresponds to summing /(j, 1) over j and I separately. Thus (3.2) follows 
from (3.6), (3.8), and (3.9) under these conditions. 

Suppose for the moment that fc = R, and the aj’s and &;’s are nonnegative 
real numbers for each j, I > 0. This implies that the c„’s are nonnegative real 
numbers for each n > 0, and that each of the three sums in (3.2) is defined 
as a nonnegative extended real number. In this case, it is well known and not 
difficult to check that (3.2) always holds, with suitable interpretations when the 
sums are infinite. More precisely, the right side of (3.2) should be interpreted as 
being equal to 0 whenever one of the factors is equal to 0, even if the other factor 
is infinite, and otherwise the right side of (3.2) should be interpreted as being 
infinite when one of the factors is infinite and the other is positive. This may 
be considered as a consequence of the discussion in Section 1.11 for nonnegative 
real-valued functions, using the interpretations just mentioned for the right side 
of (3.9). 

Suppose now that fc = R or C, with the standard absolute value function. 
Note that 

n 

(3.10) |c„| < \aj\ 

j=o 

for each n > 0, and hence that 

OO OO Tl OO OO 

(3.11) EI""I^E(5Zkii^"-^i) = (Eki) (Ei^'I)- 

n—0 n—0 j—O j—O l—O 

with suitable interpretations for the right side of (3.11), as in the preceding 
paragraph. This implies that converges absolutely when 

bi converge absolutely, in which case one can check that (3.2) holds, by 



3.2. FORMAL POWER SERIES 


43 


approximating the various sums by finite sums. This can also be seen as a 
consequence of the discussion in Section 2.9 for summable functions, using the 
fact that (3.7) is summable on (3.3). Alternatively, one can reduce to the 
analogous statement for nonnegative real numbers, by expressing 

linss-r combinations of convergent series of nonnegative real numbers. 
If k is any field with an ultrametric absolute value function | • |, then 

(3.12) |c„| < max (IojI |6„_j|) 

0<j<n 

for each n > 0. Using this, it is easy to see that {c„}))Ug converges to 0 in A: 
when {ojl^g and {l>/})Ug converge to 0 in k. If k is complete with respect to 
the metric associated to | • |, then it follows that the corresponding infinite series 
converge in k. In this situation, one can check that (3.2) holds, by approximating 
the various sums by finite sums again. As before, this can also be derived from 
the discussion in Section 2.9, using the fact that (3.7) vanishes at infinity on 
(3.3) when {a^l^g and {&i}“g converge to 0 in k. 

3.2 Formal power series 

Let ko be a field, and let T be an indeterminate. By a formal power series in T 
with coefficients in fcg we mean an expression of the form 

OO 

(3.13) f{T) = J2fjT^, 

j=o 

where fj G ko for each nonnegative integer j. The collection of all formal power 
series in T with coefficients in ko is denoted /cq [[?"]], as usual. More precisely, 
the elements of A:o[[T']] correspond to sequences {/jj^g of elements of ko, or 
equivalently to functions from the set Z+ U {0} of nonnegative integers into ko. 
Thus fco[[r]] may be defined as the collection of such sequences, or equivalently 
as the space of fco-valued functions on Z+ U {0}. However, it is often more 
convenient to represent elements of fco[[r]] as in (3.13). Note that A:o[[r]] is a 
vector space over ko with respect to termwise addition and scalar multiplication. 

Let f{T) and g{T) be elements of fco[[T']], where f(T) is as in (3.13), and 
similarly 

OO 

(3.14) g{T) = Y,9iT' 

1=0 

for some gi € fco- The product of f{T) and g{T) can be defined formally in the 
usual way, with 

(3.15) 

for all j,l >0. This means that 


f{T)g{T) = Y,T.f^SiT^^'’ 

j=0 1=0 


(3.16) 



44 


CHAPTER 3. ADDITIONAL EXAMPLES AND RESULTS 


where there are only finitely many terms involving T” for each nonnegative 
integer n. Collecting these terms, we get that 

OO 

(3.17) f{T)g{T) = J2(f9)nT-, 

n—0 

where 

n 

(3.18) (/ g)n = fj dn-j 

3=0 

for each n > 0. Of course, this is the same as the Cauchy product, discussed in 
the previous section. More precisely, one can use (3.17) and (3.18) as the official 
definition of multiplication on A:o[[7^]]: which makes sense directly at the level of 
the corresponding sequences of coefficients in Uq. One can also check that this 
makes A:o[[r]] into a commutative ring, and in fact an algebra over k^. 

A formal polynomial f{T) in T with coefficients in may be considered as a 
formal power series (3.13) such that fj = 0 for all but finitely many j. Thus the 
collection ko[T] of formal polynomials in T may be considered as a subalgebra of 
fco[[T]]. Similarly, fco may be identified with the subalgebra of kQ\T] consisting 
of power series (3.13) such that fj = 0 for every j > 1. With this identification, 
the multiplicative identity element 1 in fcg corresponds to T^ in ko[T], which is 
the multiplicative identity element in fco[[r]]. 

If f{T) is a nonzero formal power series in T with coefficients in fco, then let 
n{f(T)) be the smallest nonnegative integer n such that 

(3.19) /„ ^ 0. 

If g{T) is another nonzero formal power series in T with coefficients in k^, then 
it is easy to see that f{T) g{T) 0 too, and that 

(3.20) n(/(T) gifT)) = n{f{T)) + nigifT)). 

Let us extend n{f{T)) to the case where f{T) = 0 by putting n(0) = +oo, so 
that (3.20) holds with the usual interpretations for every f{T),g(T) G fco[[r]]. 
Note that 

(3.21) n(a/(T))=n(/(T)) 

for every /(T) G Ao[[7^]] and a G fco with a 0, which may be considered as a 
special case of (3.20). We also have that 

(3.22) n(/(T) + g{T)) > min(n(/(r)),n( 5 (T))) 

for every f{T),g{T) G fco[[r]]) with the usual interpretations for infinite values 
of n(.). 

Let r be a positive real number strictly less than 1, and put 

(3.23) |/(T)| = |/(T)|, = r 


.NRT)) 



3.2. FORMAL POWER SERIES 


45 


for every f(T) G A:o[[T]] with f(T) ^ 0, and |/(r)| = 0 when /(T) = 0. Thus 

(3.24) |/(r)+ 5 (T)|<max(|/(r)|,| 5 (T)|) 
for every f{T),g{T) G ko[[T]], by (3.22). Similarly, 

(3.25) \f{T) 9 {T)\ = \f{T)\\g{T)\ 
for every f(T),g{T) G fco[[T']], by (3.20). In particular, 

(3.26) |a/(r)| = |/(T)| 

for every f{T) G fco[[7"]] and a G ko with a 0, as in (3.21). It follows that 
|/(T)| defines an ultranorm on fco[[7’]] as a vector space over ko, and using the 
trivial absolute value function on kg. 

This implies that 

(3.27) |/(r)_g(T)| 

defines an ultrametric on A:o[[T]]: which determines a topology on fcoil?’]] in 
the usual way. As before, fcollT"]] can be identified with the Cartesian product 
of a family of copies of ko indexed by Z+ U {0}. It is easy to see that the 
topology on A:o[[T]] determined by (3.27) corresponds to the product topology 
on this Cartesian product, using the discrete topology on ko. If ko has only 
finitely many elements, then A:o[[T]] is compact with respect to this topology, 
by Tychonoff’s theorem. Note that ko[T] is dense in A:o[[T]] with respect to this 
topology, for any ko. 

If a is any positive real number, then 

(3.28) |/(r)|“ = |/(r)|.<. 

for every f{T) G A:o[[T]], by the definition (3.23) of \f{T)\r. The corresponding 
ultrametric 

(3.29) |/(T)-g(r)|“ = |/(r)-5(T)|,. 

determines the same topology on A:o[[T]] for every a > 0, as in Section 1.2. Of 
course, this also follows from the description of this topology on fco[[r]] as the 
product topology associated to the discrete topology on fco, as in the preceding 
paragraph. 

Let {//(T)}“^ be a sequence of elements of A:o[[7’]]j with 

OO 

(3.30) fi{T) = J2hiT^ 

j=0 

for each I > 1, and let f{T) be another element of fco[[T]], as in (3.13). One can 
check that {//(T)}“i converges to f{T) with respect to the ultrametric (3.27) 
if and only if for each j > 0 we have that 


(3.31) 


fj,i = fj 



46 


CHAPTER 3. ADDITIONAL EXAMPLES AND RESULTS 


for all sufficiently large I, depending on j. Similarly, {/i(ir)}“^ is a Cauchy 
sequence in fco[[T']] with respect to (3.27) if and only if for each j > 0, fj^i is 
constant in I for sufficiently large I, depending on j. It follows that every Cauchy 
sequence in A:o[[T]] with respect to (3.27) converges to an element of fco[[iC]], so 
that fco[[iC]] is complete as a metric space with respect to (3.27). 


3.3 Geometric series 


Let A: be a field, with an absolute value function | • |. It is well known and easy 
to check that 

n 

(3.32) (1 — x) ^ = 1 — 

j=o 

for every x € k and nonnegative integer n, where x^ is interpreted as being 
equal to I, as usual. This implies that 


(3.33) 


n 




1 - 


I — a: 


for every n > 0 when x ^ 1, and hence that 

” 1 

(3.34) lim x^ = - - 

i-0 

when \x\ < 1, because |a;”+^| = |a;|"+^ —>■ 0 as n —>■ oo. Thus the geometric 
series converges in k when \x\ < 1, with sum equal to 1/(1 — a;), as 

usual. 

Consider the case where k = Q, equipped with the p-adic absolute value | • |p 
for some prime number p. If w S Z, then 


(3.35) 


\pw\p < 1/p < 1, 


and we get that 
(3.36) 


n 

lim S^(pwy 
i=o 


1 

1 —pw’ 


as in (3.34). Of course, the limit in (3.36) is taken with respect to the p-adic 
metric on Q. 

Now let ko be a field, let T be an indeterminate, and let A;o[[T]] be the algebra 
of formal power series in T with coefficients in ko, as in the previous section. 
Also let r € (0,1) be given, and let |/(T)| be as defined in (3.23). Note that 
a(T) G Aiq))?"]] satisfies 
(3.37) |a(r)| < 1 


if and only if the constant term in a(T) is equal to 0, which is the same as saying 
that 


(3.38) 


a{T) = Tb{T) 



3.4. FORMAL LAURENT SERIES 


47 


for some b{T) € fco[[r]]. This implies that 

(3.39) a(T)' = r' b{Ty 

for each positive integer I, and hence that the term in a(r)* is equal to 0 
when j < 1. We also have that 

n 

(3.40) (l-a(T)) ^a(T)'= l-a(r)”+^ 

1=0 

for each nonnegative integer n, as in (3.32). 

Observe that the term in 

n n 

(3.41) '^a{Ty = '^T^b{Ty 

1^0 1^0 

does not depend on n when n> so that one can define 

oo oo 

(3.42) T^byry 

1=0 1=0 

as an element of fcollT]] in the obvious way. Equivalently, the sequence of partial 
sums (3.41) converges in A:o[[T']] with respect respect to the topology described 
in the previous section. One can also check that 

OO 

(3.43) (l-a(r)) ^a(r)' = l 

1=0 

under these conditions, using (3.40). Thus 1 — a{T) has a multiplicative inverse 
in fco[[T]] when a(T) G A:o[[T']] satisfies (3.37), which is the same as (3.38). 

3.4 Formal Laurent series 

Let ko be a field again, and let T be an indeterminate. By a formal Laurent 
series in T with coefhcients in ko we mean an expression of the form 

OO 

(3.44) /(T)= ^ /, T^ 

j=-oo 

with fj G ko for each integer j. As before, such a formal Laurent series /(T) 
is supposed to correspond exactly to a doubly-infinite sequence {/j}^_oo of 
elements of ko, or equivalently to a fco-valued function on the set Z of integers. 
In particular, the space of these series is a vector space over ko in an obvious 
way, with respect to termwise addition and scalar multiplication. 

Let ko{{T)) be the space of formal Laurent series /(T) in T with coefficients 
in ko such that fj = 0 for all but finitely many negative integers j. This is 



48 


CHAPTER 3. ADDITIONAL EXAMPLES AND RESULTS 


a linear subspace of the vector space of all formal Laurent series in T with 
coefficients in fco, as in the preceding paragraph. An element of k{{T)) may be 
expressed as 

OO 

(3.45) f{T) = J2fjT^ 

j=n 

for some integer n, where it is understood that fj = 0 when j < n. It is 
sometimes convenient to use the notation 

(3.46) /(T)= ^ /,T^ 

j>> — oo 

as on p27 of [2], to indicate that fj = 0 for all but finitely many negative integers 
j, without specifying an integer n as in (3.45). 

Let f{T),g{T) G ko{{T)) be given, where /(T) is as in (3.46), and similarly 

(3.47) g{T)= 9iT^ 

Z>> —OO 

for some gi € ko- As in Section 3.2, the product of /(T) and g{T) can be defined 
formally by 

(3.48) f{T)g{T)= ^ ^ fj9iT^+\ 

j>> — 00 />> —OO 

since there are only finitely many terms involving for any integer n. More 
precisely, if we collect the terms involving T", then we get that 

(3.49) f{T)g{T)= ^ (/ff)„T", 

n>> —OO 

where 

(3.50) (/g)„= ^ f,gi 

j + l = n 

for each n G Z. Note that (3.50) is indeed a sum over finitely many j,l G Z 
for every n G Z, and that (3.50) is equal to 0 for all but finitely many negative 
integers n, so that (3.49) is an element of ko{{T)). As before, we use (3.49) and 

(3.50) as the official definition of multiplication on ko{{T)), which makes sense 
directly at the level of the corresponding sequences of coefficients in ko- 

It is not difficult to check that ko{{T)) is a commutative ring with respect 
to this definition of multiplication, and in fact a commutative algebra over ko. 
Let us identify each /(T) G fco[[?"]] with an element of ko{{T)), by putting 
fj = 0 when j < 0. This makes fco[[T]] a subalgebra of ko{{T)), and thus ko and 
ko[T] can be identified with subalgebras of ko{{T)) as well. In particular, the 
multiplicative identity element I in ko corresponds to the multiplicative identity 
element in ko{{T)). 

If /(T) G ko{(T)) and /(T) ^ 0, then f(T) can be expressed as 


(3.51) 


f(T) = cT”(i-r6(r)) 



3.4. FORMAL LAURENT SERIES 


49 


for some c £ ko with c ^ 0, n G Z, and b{T) G fco [[?"]]• Remember that 1 —T b{T) 
has a multiplicative inverse in fco[[ir]], as in the previous section. This implies 
that f(T) has a multiplicative inverse in ko{{T)), given by 

(3.52) /(r)-i =c-iT-”(l-T6(T))-\ 
so that ko{{T)) is a field. 

Let n(/(r)) be the unique integer n as in (3.51) when /(T) G ko{{T)) and 
/(T) ^ 0, which is the same as saying that /„ 0 and fj = 0 for every j < n. 

Let us also put n(0) = +oo, so that this definition of n{f{T)) extends the one 
for f{T) G fco[[r]] in Section 3.2. It is easy to see that (3.20), (3.21), and 
(3.22) continue to hold for f{T),g{T) G fco((T)), with the usual interpretations 
for infinite values of n(-). Of course, fco[[?"]] corresponds exactly to the set of 
f{T) G ko{{T)) such that n(/(T)) > 0. 

As in Section 3.2 again, we let r be a positive real number strictly less than 
1, and put 

(3.53) |/(r)| = |/(T)|, = r"(^(^» 

when /(T) G ko{{T)) and f(T) ^ 0, and |0| = 0. This extension of |/(T)| to 
f{T) G kol{T)) continues to satisfy (3.24), (3.25), and (3.26), by the analogues of 
(3.20), (3.21), and (3.22) for f(T),g{T) G ko{{T)). It follows that |/(T)| defines 
an ultrametric absolute value function on the field ko{{T)), whose restriction to 
ko is the trivial absolute value function on fco- Note that (3.28) also continues 
to hold for every a > 0 and f{T) G ko{{T)). 

Put 

(3.54) T"fco[[r]] = {T"/(r):/(T)Gfeo[m]} 

= {git)£koiiT)):\giT)\<n 

for each n £ k^. If fco has only finitely many elements, then we have seen that 
fcoilT"]] is compact with respect to the topology determined by the ultrametric 
associated to |/(r)|, as in Section 3.2. Similarly, (3.54) is compact for each 
n G Z in this case. This implies that closed and bounded subsets of ko{{T)) 
are compact, because every bounded subset of fco((r)) is contained in (3.54) for 
some n G Z. 

Let ko be any field again, and let {/i(T)}“^ be a sequence of elements of 
ko{{T)). Thus for each I > 1, fi{T) may be expressed as 

(3.55) fi{T)= 

j>> —OO 

where fjj £ ko. This sequence is bounded in ko{{T)) with respect to the absolute 
value function | • | defined earlier if and only if there is an n G Z such that fi{T) 
is an element of (3.54) for each I > I. Equivalently, this means that there is an 
n G Z such that fi(T) can be expressed as 

OO 

MT) = J2hiT^ 

j^n 


(3.56) 



50 


CHAPTER 3. ADDITIONAL EXAMPLES AND RESULTS 


for each ^ > 1. 

Let {/i(r)}“^ be a sequence of elements of koUT)) again, as in (3.55), 
and let f(T) be another element of ko{(T)), as in (3.46). One can check that 
{/;(T)}“i converges to /(T) with respect to the ultrametric associated to the 
absolute value function defined earlier if and only if {/i(T)}“i is a bounded 
sequence in ko{{T)), and for each j G Z we have that 

(3.57) = /, 

for all sufficiently large I, depending on j. Similarly, {fi{T)}fL^ is a Cauchy 
sequence in fco ((?")) with respect to the ultrametric associated to the absolute 
value function defined earlier if and only if {fiiT)}fLi is a bounded sequence in 
ko{{T)), and for each j G Z, fj^i is constant in I for sufficiently large I, depending 
on j. In particular, it follows from this that every Cauchy sequence in ko{{T)) 
converges to an element of ko{(T)), so that ko{{T)) is complete with respect to 
the ultrametric associated to the absolute value function defined earlier. 


3.5 j9-Adic integers 

Let p be a prime number, and let | • |p be the p-adic absolute value on Q, as in 
Section 1.3. Thus every integer x satisfies 

(3.58) \x\p < 1, 

which implies that (3.58) also holds for every a; G Q in the closure of the set Z 
of integers with respect to the p-adic metric. Conversely, suppose that p G Q 
satisfies \y\p < 1, and let us check that y is in the closure of Z in Q with respect 
to the p-adic metric. By definition of the p-adic absolute value, y = a/b for 
some a, 6 G Z such that b ^ 0 and b is not an integer multiple of p. It follows 
that there are c, re G Z such that 


(3.59) 


be = 1 — pw, 


because the integers modulo p form a field, and hence 


(3.60) 


a ac ac 
b be 1 — pw 


We have seen that 1/(1 — pw) can be expressed as the limit of a sequence of 
integers with respect to the p-adic metric, as in (3.36) in Section 3.3. This 
implies that y has the same property, as desired. 

The set Zp of p-adie integers is defined to be the closed unit ball in Qp with 
respect to the p-adic metric, which is to say that 


(3.61) 


Zp — {x G Qp : \^\p — 7}. 


Clearly Z C Zp, which implies that Zp contains the closure of Z in Qp with 
respect to the p-adic metric. In fact, one can check that Zp is equal to the closure 



3.6. RADIUS OF CONVERGENCE 


51 


of Z in Qp, using the remarks in the preceding paragraph. More precisely, one 
can first verify that Q n Zp is dense in Zp, because Q is dense in Qp, and using 
the ultrametric version of the triangle inequality. The discussion in the previous 
paragraph implies that Z is dense in Q n Zp, and hence that Z is dense in Zp, 
as desired. 

Put 

(3.62) Z = {p>x:x€Z} 
and 

(3.63) p’ Zp = {p’ X : X € Zp} = {y € Qp : \y\p <p~^} 

for each j G Z. It is easy to see that p^ Zp is the closure of p^ Z in Qp for each 
j G Z, because of the statement for j = 0 discussed in the preceding paragraph. 
Note that Z is a subgroup of Q with respect to addition for each j G Z, and 
similarly pP Zp is a subgroup of Qp with respect to addition for each j G Z. Of 
course, Z is a subring of Q, and one can check that Zp is a subring of Qp too. 
If J is a nonnegative integer, then Z is an ideal in Z, and pp Zp is an ideal in 
Zp. This implies that the quotients 

(3.64) ZjpPZ 
and 

(3.65) ZpjpP Zp 

are defined as commutative rings when j > 0. The obvious inclusion of Z in 
Zp leads to a ring homomorphism from (3.64) into (3.65) for each nonnegative 
integer j, since p^ Z is contained in pP Zp. This homomorphism is actually 
injective for each j > 0, because 

(3.66) Z n {p> Zp) =pPZ 

for every nonnegative integer j, as one can verify directly from the definitions. 
One can also check that this homomorphism from (3.64) into (3.65) is surjective 
for each j > 0, using the fact that Z is dense in Zp. Thus this homomorphism 
from (3.64) into (3.65) is an isomorphism for each j > 0. 

It follows that (3.65) has exactly pP elements for every nonnegative integer 
j, so that Zp can be expressed as the union of pp pairwise-disjoint closed balls of 
radius p~^ in Qp for each j > 0. This implies that Zp is compact with respect 
to the p-adic metric, because Zp is closed and totally bounded in Qp, and Qp is 
complete. Similarly, p* Zp is a compact subset of Qp for every Z G Z. Of course, 
every bounded subset of Qp is contained in p* Zp for some Z G Z, and hence 
every closed and bounded subset of Qp is compact. 


3.6 Radius of convergence 

Let fc be a field with an absolute value function | ■ |, and suppose that k is 
complete with respect to the metric associated to | ■ |. Also let uq, ai, 027 os, • ■ • 



52 


CHAPTER 3. ADDITIONAL EXAMPLES AND RESULTS 


be a sequence of elements of k, and consider the corresponding formal power 
series 

OO 

(3.67) f{X)='£a,X^, 

3=0 

where X is an indeterminate. As in [2, 12], we use upper-case letters like X for 
indeterminates, and lower-case letters like x for elements of k or other fields. If 
X G k, then we can consider the convergence of the power series 

OO 

(3.68) aj , 

3=0 

where x'^ is interpreted as being the multiplicative identity element 1 in fc, as 
usual. Of course, if (3.68) converges for some x € k, then 

(3.69) lim Gj x^ = 0 

j-s-oo 

in k, and hence 

(3.70) {aj is a bounded sequence 

in k. 

If 

(3.71) {jojl is 0 . bounded sequence 

in R for some nonnegative real number t, then 

(3.72) lim ja^l N = 0 

j^oo 

for every nonnegative real number r < t, and in fact 

OO 

(3.73) '£\^3\r^ 

3=0 


converges in R when 0 < r < t. Let p be the supremum of the set of r > 0 
such that (3.72) holds, which automatically includes r = 0. As usual, p is taken 
to be -foo when (3.72) holds for arbitrarily large r. Equivalently, p may be 
defined as the supremum of the set of t > 0 such that (3.71) holds, or as the 
supremum of the r > 0 such that (3.73) converges. We can also characterize 
p as the unique nonnegative extended real number such that (3.73) converges 
when 0 < r < p, and (3.71) does not hold for any t > p. It follows that (3.68) 
converges absolutely for each x € k with |a;| < p, and that (3.68) does not 
converge for any x € k with |x| > p. Of course, p is known as the radius of 
convergence of the formal power series (3.67). It is well known that 


(3.74) 


p = \ lim sup I Gj 

^ j—^OC ' 


-1 


with the usual conventions that 1/0 = -foo and 1/ -|- oo = 0. 



3.6. RADIUS OF CONVERGENCE 


53 


Let us suppose for the rest of the section that either 

(3.75) fc = R or C, with the standard absolute value function, 
or that 

(3.76) I • I is an ultrametric absolute value function on a field k, 
and k is complete with respect to the associated ultrametric. 

If fc = R or C, and (3.73) converges for some r > 0, then (3.68) converges 
absolutely for every x G k with \x\ < r, and the sequence of partial sums 

n 

(3.77) 

j=o 

converges uniformly to the sum (3.68) on the closed ball 

(3.78) B{0,r) = {x G k : \x\ < r}. 

Similarly, if k is as in (3.76), and (3.72) holds for some r > 0, then (3.68) 
converges in k for every x G k with |a;| < r, and the partial sums (3.77) converge 
to the whole sum (3.68) uniformly on i?(0, r). In both cases, it follows that (3.68) 
defines a continuous fc-valued function on i?(0, r). Using this, one can check that 
(3.68) defines a continuous /c-valued function on the open ball 

(3.79) B{0,p) = {x G k : \x\ < p}, 

where p is the radius of convergence, as in the preceding paragraph. 

Let 

OO 

(3.80) Y 

be another power series with coefficients in k, and let 

n 

(3.81) Cn — ^ ( cij kji—j 

t=0 

be the Cauchy product of the coefficients of (3.68) and (3.80) for each n > 0, as 
in (3.1) in Section 3.1. Thus 

n 

(3.82) c„ x” = X^~^) 

for each n > 0, so that 

OO 

(3.83) ^ c„ a;” 

n=0 



54 


CHAPTER 3. ADDITIONAL EXAMPLES AND RESULTS 


is the same as the Cauchy product of (3.68) and (3.80). If A: = R or C, and 
(3.68) and (3.80) converge absolutely for some x G k, then it follows that (3.83) 
also converges absolutely, and satisfies 

OO OO OO 

(3.84) 

n—0 j—0 l—O 

as in Section 3.1. Similarly, if k is as in (3.76), and (3.68) and (3.80) converge 
in k for some x G k, then (3.83) converges in k too, and satisfies (3.84). 

3.7 Compositions 

Let A: be a field, and let 

OO 

(3.85) f{x) = ^ aj x^ 

j=o 

and 

OO 

(3-86) g{y) = ^hy'' 

1=0 

be power series with coefficients in k. We would like to consider the composition 

OO 

(3-87) fi9{y)) = '^aj9{yy 

j=o 

of these two series, at least formally for the moment. Put 

(3.88) Ej = (Z+ U {0})^ 

for each j G Z+, which is the jth Cartesian power of Z+ U {0}, consisting of 
j-tuples a = (oi ,... ,aj) nonnegative integers. Also put 

(3.89) dj{oc) — oc\ -\- OC 2 “t“ *' * “t“ ocj 
and 

(3.90) Pj (a) = ba^ ba^ 

for each a G Ej. Thus 

(3.91) 5(2/)^ = E 

for each j G Z+, at least formally, and in particular this holds for every y G k 
when bi = 0 for all but finitely many I, so that the sum on the right side of 

(3.91) reduces to a finite sum. 

It will be convenient to take Eg to be a set with exactly one element not in 
Ej for any j G Z+, so that the Ej^s are pairwise disjoint for all j > 0. Put 

OO 

E=\jE„ 
j=o 


(3.92) 



3.7. COMPOSITIONS 


55 


and let </> be the fc-valued function on E defined by 

(3.93) 4>{c() = O'j Pj{o) 

for each a G Ej when j > 1, and 4> = qq on Eq. Similarly, let d be the function 
on E with values in U {0} defined by 

(3.94) d = dj on Ej 

for each j > 0, with dg = 0 on Eq. Combining (3.87) and (3.91), we get that 

OO 

(3.95) f{g{y)) = + ( XI = X 

j—1 ct^Ej ctGE 

at least formally. In particular, if aj = 0 for all but finitely many j, and 5/ = 0 
for all but finitely many I, then (j) G coo(E, k), and (3.95) holds for every y € k. 
Put 

OO 

(3.96) An = {a € E : d{a) = n} = \^ {a € Ej : dj{a) = n) 

3=0 

for each nonnegative integer n, so that the A^s are pairwise disjoint and 

OO 

(3.97) E=\jAn. 

n—0 

If we put 

(3.98) c„ = ^ 4){a) 

qgA„ 

for each n > 0, then we get that 

OO OO 

(3-99) figiy)) = X ( X = X 2 /”, 

n—0 ocGAn n—0 

at least formally, by (3.95). As before, if Uj = 0 for all but finitely many j, and 
bi = 0 for all but finitely many I, then (j) G coo{E,k), and (3.99) holds for all 
y €k. 

Suppose for the moment that A: = R or C, with the standard absolute value 
function, and that 

OO 

(3.100) 

3=0 

converges for some r > 0. Suppose also that 

OO 

X 

1=0 


(3.101) 



56 


CHAPTER 3. ADDITIONAL EXAMPLES AND RESULTS 


for some < > 0, so that for each y € k with |?/| < t the series in (3.86) converges 
absolutely and satisfies 

(3.102) \giy)\<r. 

It follows that the series in (3.87) converges absolutely when \y\ < t too. Observe 
that 

OO 

(3.103) ^ <N 

aGEj 1=0 

for each j > 1, and hence that 

OO OO 

(3.104) ^ |^(a)|t'^(“) = |oo||ajl 

a^E j—1 aGEj j—0 

U y € k and \y\ < t, then we get that 

(3.105) I3j{a) is summable on Ej 
for each j > 1, and that 

(3.106) (j>{a) is summable on E. 

Using (3.105), it is easy to see that (3.91) holds for each j, as before. More 
precisely, the sum on the right side of (3.91) may be treated as an iterated 
sum over each of the j factors of Z+ U {0} in Ej, which can be evaluated using 
(3.86) in each coordinate. Similarly, (3.95) holds under these conditions, by the 
remarks in Section 2.9. 

Using (3.97), we can rearrange the sum in (3.104), to get that 

OO OO 

(3-107) E( E = 

n=0 olGAu olGE j—Q 

Equivalently, this means that 

OO OO 

(3.108) E \Ha)\)e<J2\a,\N, 

n—0 o(GAn j—0 

by the definition (3.96) of An. Because t > 0, it follows that 

(3.109) (j){a) is summable on An 

for each n > 0, so that the sum in (3.98) is defined for each n > 0. If |y| < t, 
then (3.106) permits us to go from (3.95) to (3.99), as in Section 2.9. More 
precisely, the sum on the right side of (3.99) converges absolutely when |y| < t, 
and the value of the sum is equal to f{g{y)). Note that 

\cn\ < ^ \(j){a)\ 

aGAn 


(3.110) 



3.7. COMPOSITIONS 


57 


for each n > 0, by the definition (3.98) of c„. Thus (3.108) implies that 

OO OO 

(3.111) <'^\aj\C, 

n—0 j—0 

and in particular that the left side of (3.111) converges under these conditions. 

Now let k be any field with an ultrametric absolute value function | • | such 
that k is complete with respect to the ultrametric associated to | • |. Suppose 
that 

(3.112) lim \aj \ N = 0 

j^oo 

for some r > 0, and that t > 0 satisfies 

(3.113) lim |5/| = 0 

and 

(3.114) max|5/|t^<r. 

U y € k and \y\ < t, then (3.113) implies that the series in (3.86) defining g{y) 
converges in k, and (3.114) implies that 

(3.115) \ 9 {y)\<r. 

It follows that the series in (3.87) converges in k as well under these conditions, 
by (3.112). Using (3.113), one can check that 

(3.116) \l3j(a)\ vanishes at infinity on Ej 
for each j > 1. Moreover, 

(3.117) max |/3 ,(q;)| f max(|6i| <N 

ct^Ej \ l>0 J 

for each j > 1, by (3.114). Thus 

(3.118) max |())(a)| = \aj\ max |/1 ,(q;)| < \aj\N 

ctGEj 

for each j > 1, which tends to 0 as j ^ oo, by (3.112). This implies that 

(3.119) l<('(c«)l vanishes at infinity on E, 

using (3.116) to get that the restriction of |(('(q;)| to Ej vanishes at infinity 
on Ej for each j > 1. If y € fc and |j/| < t, then we obtain that 

(3.120) (a) vanishes at infinity on Ej 

for each j > 1, by (3.116), and that 

(3.121) (p{a) y'^^°‘'^ vanishes at infinity on E, 



58 


CHAPTER 3. ADDITIONAL EXAMPLES AND RESULTS 


by (3.119). Hence 

(3.122) ^ satisfies the generalized Cauchy criterion 

for each j > 1, as in Section 2.6, and similarly 

(3.123) ^ (j){a) satisfies the generalized Cauchy criterion. 

aeE 

This means that these sums can be defined as elements of fc, as in Section 2.7, 
because k is complete. As before, (3.91) and (3.95) hold under these conditions, 
by the remarks in Section 2.9. 

Of course, (3.119) implies that the restriction of (j){a) toa € A„ vanishes 
at infinity on A„ for each n > 0. This implies that 

(3.124) 4>io:) vanishes at infinity on A„ 

for each n > 0, since d(a) = n for every a G A„, by the definition (3.96) of A„, 
and t > 0. It follows that 

(3.125) ^ ())(a) satisfies the generalized Cauchy criterion 

for each n > 0, as in Section 2.7, so that (3.98) is well defined for each n > 0. 
li y € k and |i/| < t, then (3.123) permits us to go from (3.95) to (3.99) again, 
as in Section 2.9. More precisely, this means that the sum on the right side of 
(3.99) converges in k when |j/| < t, and that the value of the sum is equal to 
figiy))- Note that 

(3.126) |c„| < max |(/>(q;)| 

for each n > 0, by the definition (3.98) of c„ and the ultrametric version of the 
triangle inequality. Thus 

(3.127) |c„|t" < max \(j){a)\C = max |0(a)|t‘^^“^ 

aGAn aGAn 

for each n > 0, using the definition (3.96) of A„ in the second step. In particular, 

(3.128) lim|c„|r = 0, 

n—^oo 

because of (3.119), and because the A„’s are pairwise-disjoint subsets of E. 

3.8 Compositions, continued 

Let fc be a field, and let 

OO 

(3.129) f{X) = Y,a,X^ 

1=0 



3.8. COMPOSITIONS, CONTINUED 


59 


and 

OO 

(3.130) g{Y) = J2biY‘ 

1=0 

be formal power series with coefficients in k. As in the previous section, we 
would like to consider the composition 

OO 

(3.131) f{g{Y)) = J2ajg{Yy 

j=o 

of these two series, at least formally. If Ej, dj{a), and /3j(a) are as in (3.88), 
(3.89), and (3.90), respectively, then we have that 

(3.132) g{Yy = Y, 

for each j G Z+, as in (3.91). More precisely, put 

(3.133) Aj^n = {a G Ej : dj{a) = n} 

for each j G Z_|_ and nonnegative integer n, so that the Aj^n’s are pairwise- 
disjoint finite subsets of Ej such that 

OO 

(3.134) E, = U A,-„. 

n—0 

Thus 

(3.135) Cj^n = Pjict) 

a^Aj^ri 

is defined as a finite sum of elements of k for each j G Z+ and n > 0, and (3.132) 
may be interpreted as saying that 

OO 

(3.136) g{Yy = Ycj,nY- 

n—0 

for each j G Z+, as formal power series in Y. 

If E, (j), and d are as in (3.92), (3.93), and (3.94), respectively, then we get 
that 

OO 

(3.137) f{g{Y)) = ao + ^ ^ a, /?,(a) ^ <^(a) 

j=l ocG Ej cuGE 

at least formally, as in (3.95). Similarly, if A„ and c„ are as in (3.96) and (3.98), 
respectively, then we get that 

OO OO 

figiY)) = E ( E 

n—0 aGAn n—0 


(3.138) 



60 


CHAPTER 3. ADDITIONAL EXAMPLES AND RESULTS 


at least formally, as in (3.99). More precisely, note that 

OO 

(3.139) ^n=\J A,.„ 

i=i 

when n > 1, and that 

OO 

(3.140) Ao = £;ou(|JA,.o), 
where Eq is as in the previous section. This implies that 

OO 

(3.141) Cn — ^ ^ dj 

i=i 

when n > 1, and that 

OO 

(3.142) cq = gp + y^ajCyo, 

at least formally. Thus (3.138) is basically the same as saying that 

OO OO OO OO 

(3.143) fig{Y)) = ao+J2 diYY = oq + ^ ^ a, c,,„ F” = ^ c„ F", 

j=l n—0 n—0 

at least formally again, using (3.136) in the second step, and interchanging the 
order of summation in the third step. If Oj = 0 for all but finitely many j, so 
that f{X) is actually a formal polynomial in X, then (3.131) reduces to a finite 
sum of products of formal power series, as in Section 3.2. In this case, (3.141) 
and (3.142) reduce to finite sums in fc, and there is no problem with (3.143). 
Observe that 

(3.144) Cj^o = bij 
for each j > 1, so that (3.142) becomes 

OO 

(3.145) co = '^ajbl. 

3=0 


As in (3.143), this is the expected constant term in f{g{Y)), at least formally. 
If Qj Y 0 for infinitely many j, and 6o 7 ^ 0, then one would normally need some 
additional convergence hypotheses to make sense of (3.145). Similarly, c„ may 
involve sums of infinitely many nonzero terms when aj Y 0 for infinitely many 
j, bo Y Oj and n > 1. However, if bo = 0, then it is easy to see that (3.131) 
makes sense as a formal power series in F. In this case. 


(3.146) /3^-(a)=0 

for every a G Ej such that dj{a) < j, because at least one of the coordinates of 
a has to be equal to 0. This implies that 


(3.147) 


Cj,n — 0 



3.9. CHANGING CENTERS 


61 


when j > n, so that the sums in (3.141) and (3.142) have only finitely many 
nonzero terms. It is a bit simpler to take 

(3.148) Ej = Z\ 

in this situation, instead of (3.88), which amounts to throwing away the terms 
that are automatically equal to 0 when bo = 0. With this definition of Ej, we 
have that dj{a) > j on Ej, Aj^n = 0 when j > n, and that An has only finitely 
many elements for each n > 0. 

Now let fco be a field, let T be an indeterminate, and let ko{{T)) be the 
corresponding field of formal Laurent series with coefficients in kg and poles 
of finite order in T, as in Section 3.4. Also let r be a positive real number 
strictly less than 1, and let | • | be the corresponding absolute value function on 
ko{{T)), as before. If f{X) is a formal power series in an indeterminate X with 
coefficients aj € ko, as in (3.129), and if 

OO 

(3.149) g{T) = J2biT^ 

1=0 

is a formal power series in T with coefficients bi € k, then 

OO 

(3.150) f{glT)) = Y,a,g{Ty 

1=0 

may be considered as an infinite series with terms in ko{(T)). Of course, if 
aj = 0 for all but finitely many j, so that f{X) is a formal polynomial in X, 
then (3.150) reduces to a finite sum, which makes sense for every g{T) G ko{(T)). 
Otherwise, if aj ^ 0 for infinitely many j, then (3.150) converges in ko{(T)) with 
respect to the ultrametric associated to the absolute value function | • | when 
\g{T)\ < 1, which means that g{T) € fco[[r]] and bo = 0. 


3.9 Changing centers 

Let A: be a field, let 

OO 

(3.151) f{x) = aj 

1=0 

be a power series with coefficients in k, and let bo be an element of k. We would 
like to consider 

OO 

(3.152) fibo + y) = '^aj{bo + yy 

1=0 

as a power series in y, at least formally, which corresponds to (3.87) in Section 
3.7 with g{y) = bo + y. Using the binomial theorem, we get that 

OO 1 ^ .X OO OO / -x 

(3.153) f{bo + y) = XI £ j ) ■ ^o”' y' = X ( X “i ('! ) ■ y' 

j^o 1^0 ^ ^ 1^0 j^l ^ ^ 



62 


CHAPTER 3. ADDITIONAL EXAMPLES AND RESULTS 


at least formally again. As usual, there is no problem with this when aj = 0 for 
all but finitely many j. 

Suppose that fc = R or C, with the standard absolute value function, and 
that 

OO 

(3.154) 

j=o 

converges for some r > 0. Of course, this implies that the series in (3.151) 
converges absolutely when x € k satisfies \x\ < r. Suppose also that 

(3.155) \bo \ + t < r 

for some t > 0, so that the series in (3.152) converges absolutely when y € k 
satisfies \y\ < t. Using the binomial theorem again, we get that 

OO j / OO OO 

(3.156) ='^\°‘j\i\^o\+tY <'^\aj\N < oo. 

j=0 1=0 ^ ' j=0 j=0 

It follows that 


OO OO 

(3.157) E(Ei 


OO J 


3=0 1=0 


' t < OO. 


In particular, 

(3.158) ^|a,| Q |5or'<oo 

for each I > 0, which means that the sum in j on the right side of (3.153) 
converges absolutely for each 1. The finiteness of (3.157) implies that the sums 
in (3.153) converge absolutely for every y G k with |t/| < t, and permits the 
interchange of summation in the second step in (3.153), as in Section 2.9. 

Now let k be an arbitrary field with an ultrametric absolute value function 
I • I such that k is complete with respect to the associated ultrametric. Suppose 
that 

(3.159) lim |aj|r^ = 0 

j^ca 

for some r > 0, so that the series in (3.151) converges in k when x € k satisfies 
\x\ < r. Suppose also that 

(3.160) |6o| < r, 

which implies that the series in (3.152) converges in k for every y G k with 
\y\ < r. Observe that 

(3.161) Qj ■ bl~^ Y < \aj\ |6oP“' r' < \aj\N 



3.10. THE RESIDUE FIELD 


63 


for every j > I > 0, using the ultrametric version of the triangle inequality and 
the fact that the binomial coefficients are integers in the first step. Thus 


(3.162) 




|y|' < \aj\r^ 


for every j > I > 0 when y G k satisfies \y\ < r. 

Put 

OO 

(3.163) ai = Qj 

3=1 

for each I > 0, where the convergence of the series in k follows from (3.159) and 
(3.161). The ultrametric version of the triangle inequality implies that 



(3.164) \ai\ < max 

3>l 


-i 


< max(|aj| |6oP 0 ^ max(|aj| U 

3>l 3>l 


for each I > 0, and hence that 


(3.165) 


|a/|r' < max(|aj|r^) 

3>l 


for each I > 0. It follows that 

OO 

(3.166) 

coverges in k for every y G k with |y| < r, because (3.165) tends to 0 as ^ oo, 
by (3.159). Of course, (3.166) is the same as the right side of (3.153), and one 
can check that (3.153) holds for every y G k with \y\ < r under these conditions. 
More precisely, this uses the fact that (3.162) tends to 0 as j ^ oo, by (3.159), 
in order to interchange the order of summation in the second step in (3.153), as 
in Section 2.9. 


3.10 The residue field 

Let k he a field with an ultrametric absolute value function | ■ |. Observe that 
the closed unit ball 

(3.167) ;B(0, 1) = {x e fc : |a;| < 1} 
in fc is a subring of k, and that the open unit ball 

(3.168) B{0,l) = {xGk:\x\<l} 
in k is an ideal in B{0, 1). Thus the quotient 

(3.169) (B(0,1)/B(0,1) 

is defined as a commutative ring, and in fact it is a field, known as the residue 
field associated to | • | on fc. More precisely, the multiplicative identity element 



64 


CHAPTER 3. ADDITIONAL EXAMPLES AND RESULTS 


1 in fc satisfies |1| = 1, so that its image in the quotient is nonzero, which is 
the multiplicative identity element in the quotient. An element x of 5(0,1) 
has a multiplicative inverse in 5(0,1) exactly when |x| = 1, which implies that 
nonzero elements of the quotient have multiplicative inverses in the quotient. 

If I • I is the trivial absolute value function on k, then 5(0,1) = k, 5(0,1) = 
{0}, and hence the residue field is the same as k. If fc = Qp equipped with 
the p-adic absolute value function for some prime number p, then 5(0,1) is the 
ring Zp of p-adic integers, 5(0,1) = pZp, and the residue field is isomorphic to 
Z/pZ, as in Section 3.5. 

If I a; I is an ultrametric absolute value function on any field k, then |a;|“ is 
also an ultrametric absolute value function on k for every positive real number 
a, as in Section 1.5. The open and closed unit balls in k with respect to |a;|“ are 
the same as for \x\ for each a > 0, which implies that the residue field associated 
to |x|“ is the same as the residue field associated to |x|. 

Let k be any field with an ultrametric absolute value function | • | again, 
and let fci be a subfield of k. The restriction of | • | to /ci is an absolute value 
function on fci, and it is easy to see that there is a natural induced injective 
homomorphism from the residue field associated to ki into the residue field 
associated to k. If ki is dense in k with respect to the ultrametric corresponding 
to I • I , then one can check that the induced homomorphism between the residue 
fields is surjective. In particular, the residue field associated to the completion of 
a field with an unltrametric absolute value function is isomorphic to the residue 
field associated to the original field in a natural way. 

Suppose that fc is a field with characteristic p for some prime number p, and 
equipped with an ultrametric absolute value function | • |. Thus p • 1 = 0 in 
fc, which implies that the analogous statement holds in the associated residue 
field, so that the residue field has characteristic p too. Alternatively, if k has 
characteristic p, then there is a natural embedding of Z/pZ into k. Let ki be 
the image of Z/pZ in k under this embedding, and note that the restriction of 
I • I to fci is trivial, by (1.30) in Section 1.3. This implies that the residue field 
associated to ki is isomorphic to Z/pZ as well, which leads to an embedding 
of Z/pZ into the residue field associated to k, by the remarks in the preceding 
paragraph. 

Let ko be a field, let T be an indeterminate, and let |/(T)| be the absolute 
value function on ko{{T)) associated to some r S (0,1), as in Section 3.4. The 
corresponding closed unit in fco((r)) is equal to fcollT"]], the open unit ball is 
equal to T A:o[[T']], and the residue field is isomorphic to ko- Of course, ko can 
also be identified with a subfield of ko{{T)). 

Suppose for the moment that | • | is a nontrivial discrete ultrametric absolute 
value function on a field fc, and that the associated residue field has exactly 
N elements for some integer N > 2. As in Section 1.9, there is a pi € (0,1) 
such that the nonzero values of | • | on fc are the same as the integer powers 
of pi- Thus open balls in k of radius 1 are the same as closed balls of radius 
pi, so that 5(0,1) can be expressed as the union of N pairwise-disjoint closed 
balls of radius pi. Using this, one can check that each closed ball in k of radius 

for some j € Z can be expressed as the union of N pairwise-disjoint closed 



3.10. THE RESIDUE FIELD 


65 


balls of radius . Repeating the process, we get that each closed ball in k of 
radius for some j S Z can be expressed as the union of N’’ pairwise-disjoint 
closed balls of radius p{'^^ for every Z e Z+. In particular, this implies that 
bounded subsets of k are totally bounded. If k is complete with respect to the 
ultrametric associated to | • |, then it follows that closed and bounded subsets 
of k are compact. 

Let k be any field with an ultrametric absolute value function | • | again. If the 
closed unit ball in k is totally bounded, then it is easy to see that the associated 
residue field is finite, and one can also check that | • | has to be discrete on k 
in this case. More precisely, the residue field is finite exactly when the closed 
unit ball can be covered by finitely many open balls of radius I, which can be 
taken to be centered at points in B{0, 1). Similarly, if the open unit ball can be 
covered by finitely many closed balls of radius less than I, which can be taken 
to be centered at points in B{0, 1), then one can verify that | ■ | is discrete on k. 
If k is locally compact with respect to the topology determined by the metric 
associated to | • |, and if | • | is not the trivial absolute value function on k, then 
the closed unit ball in k is compact. In particular, this implies that the closed 
unit ball in k is totally bounded. Remember too that k is complete with respect 
to the metric associated to | • | when k is locally compact. 



Chapter 4 

Geometry of mappings 


4.1 Differentiation 


Let fc be a field, and let | • | be an absolute value function on k. Also let A be a 
subset of k, and let x be an element of E that is a limit point of E with respect 
to the metric associated to | • |. Note that any interior point of A is a limit point 
of E when | • | is not the trivial absolute value function on k, and that k has 
no limit points when | • | is the trivial absolute value function on fc. As usual, a 
fc-valued function / on A is said to be differentiable at x if the limit of 


(4.1) 


f{y) - fix) 

y-x 


as y € E approaches x exists in A:. In this case, the derivative f'{x) of / at x is 
defined to be the value of this limit of (4.1). Equivalently, this means that 


(4.2) 


fjy) - f{x) - fix) {y - x) ^ ^ 


In particular, this implies that 


(4.3) lim {f{y) - f{x) - f{x) {y - x)) = 0, 

yGE 


and hence that 

(4-4) lim(/(j/)-/(x)) =0, 

VGE 

SO that / is continuous at x as a /c-valued function on E. 

Let g be another fc-valued function on E, and suppose that / and g are 
both differentiable at x. Under these conditions, it is easy to see that f + g is 
differentiable at x too, with 

(4.5) if + gYix) = f{x) + g'{x). 


66 



4.1. DIFFERENTIATION 


67 


Similarly, one can check that f g is differentiable at x, with 

(4-6) ifgYix) = f{x)g{x) + f{x)g'ix), 

as in the classical product rule for derivatives. More precisely, we have that 

gjy) -9{x) \ 

y-x ) 

for every y £ E with y ^ x. This implies (4.6), by taking the limit as y —>■ x of 
both sides of (4.7), and using the fact that g is continuous at x as a /c-valued 
function on E to deal with the factor of g(jj) in the first term on the right side 
of (4.7). In particular, if a G fc, then a / is a /c-valued function on E that is 
differentiable at x, with 

(4.8) (a/)'(x) = a/'(x). 

This corresponds to the case where g is the constant function on E equal to a 
at every point, so that g'{x) = 0, although one can verify (4.8) more directly 
too. 

In order to formulate the chain rule in this setting, let y be a fc-valued 
function defined on a set A C k, and let / be a fc-valued function defined on a 
set E C k. Thus the composition f o g is defined on the set 

(4.9) Ang-\E). 

Let X be an element of (4.9), so that x G A and g{x) G E. Suppose also that 
X is a limit point of (4.9), which implies in particular that x is a limit point of 
A. In the other direction, if x is a limit point of A, g{x) is an element of the 
interior of E, and g is continuous at x as a /c-valued function on A, then x is 
a limit point of (4.9) too. In addition, we would like y(x) to be a limit point 
of E. This follows from the condition that x be a limit point of (4.9) when g 
is continuous at x as a fc-valued function on (4.9) and g is not constant on the 
intersection of (4.9) with any neighborhood of x in k. 

Suppose that g is differentiable at x as a fc-valued function on A, or at least 
on (4.9). This implies that g is continuous at x, as before, which is relevant for 
some of the remarks in the preceding paragraph. If / is differentiable at y(x) 
as a fc-valued function on E, then one can verify that f o g is differentiable at x 
as a fc-valued function on (4.9), with 

(4-10) (/ o y)'(x) = /'(y(x)) g'{x). 

More precisely, the differentiability of / at g{x) implies that 

(4-11) figiy)) - figix)) - f{g{x)) (y(y) - g{x)) 

is small compared to g{y) — g{x) when g{y) is close to y(x), for g(y) G E and 
hence for y in (4.9). Similarly, the differentiability of y at x means that 


/(») g(i/) - m g(») 

y — X \ V — X / V 


y-x 


(4.12) 


giy) - gix) - g'{x) (y-x) 



68 


CHAPTER 4. GEOMETRY OF MAPPINGS 


is small compared to y — x, for y in (4.9) close to x. Combining these two 
statements, one can get that 

(4-13) f{g{y)) - figix)) - fig{x)) g'{x) {x - y) 

is small compared to x — y, for y in (4.9) close to x, as desired. This uses the 
differentiability otg at x to get that \g{y) — g{x)\ is bounded by a constant times 
\y — x\ when y in (4.9) is close to x, so that (4.11) is small compared to y — x. 


4.2 Mappings between metric spaces 

Let {Mi,di(x,y)) and {M 2 ,d 2 {u,v)) be metric spaces, and let / be a mapping 
from Ml into M 2 . Put 

(4.14) Drif)ix) =r~^ sup{d 2 ifix),f{y)) :y G Ml, di{x,y)<r} 

for each x G Mi and r > 0, where the supremum is defined as a nonnegative 
extended real number. Similarly, put 

(4.15) A(/)(x) = sup Drif)ix) 

0<r<t 

for each x G Mi and t > 0, which is also defined as an extended real number. 
Equivalently, 

(4.16) I)t(/)(x) = sup : y g Ml, 0 < di{x,y) <t^ 

when there is a y G Mi such that 0 < di{x, y) < t, and otherwise Dt{f){x) = 0. 
This can be seen by taking r = di{x,y) in (4.15). 

By construction, Dt{f){x) increases monotonically in t, and we put 

(4.17) D{f){x) = \imsup Dr{f){x) = inf 5t(/)(x) 

1 —>-0 t>U 

for each x G Mi. This is defined as a nonnegative extended real number as well, 
which may be considered as the limit of Dt{f){x) as t > 0. If x G Mi is a limit 
point of Ml, then there are y G Mi as in (4.16) for each t > 0. In this case, 
D(f){x) may be expressed equivalently by 

1 1- d2{f{x),f{y)) 

4.18) D{f)[x) = hmsup- — ---. 

y^x di[x,y) 

Otherwise, if x is an isolated point in Mi, then D{f){x) = 0. 

liD{f)ix) < A for some x G Mi and G R, then Dt{f)(x) < A for some 
t > 0, and hence 
(4.19) 


d 2 {f{x),f{y)) < Adi{x,y) 



4.2. MAPPINGS BETWEEN METRIC SPACES 


69 


for every y G Mi with di{x,y) < t. Conversely, if (4.19) holds for some x G Mi, 
> 0, t > 0, and every y G Mi with di{x, y) < t, then we get that 

(4.20) D{f){x) < Dtif){x) < A. 

Thus D{ f){x) may be described as the infimum of the nonnegative real numbers 
A for which there is a t > 0 such that (4.19) holds for every y G Mi with 
di{x, y) < t, at least when there is such an A. Otherwise, if there is no such A, 
then D{f){x) = +cx). Note that / is continuous at x when D{f){x) < oo, by 
(4.19). 

Let {Ms,d 3 {w, z)) be another metric space, let /i be a mapping from Mi 
into M 2 , and let /2 be a mapping from M 2 into M 3 . Thus the composition 
/2 o fi is defined as a mapping from Mi to M3, and we would like to show that 

(4.21) Dif 2 o fi){x) < Dif 2 ){fi{x)) Difi){x) 
for every x G Mi such that 

(4.22) D{fi)ix), D{f 2 )iMx)) < + 00 . 

More precisely, D{fi){x) is defined in exactly the same way as before, while 
D{f 2 o fi){x) and D{f 2 ){fi{x)) are defined analogously for mappings from Mi 
and M 2 into M3, respectively. To do this, let such a point x G Mi be given, 
and suppose that ^ 1,^2 G R satisfy 

(4.23) D{fi){x) < Ai 
and 

(4.24) D{f 2 ){fi{x)) < A 2 . 

As in the preceding paragraph, (4.23) implies that there is a ti > 0 such that 

(4.25) d 2 {fi{x),fi{y))<Aidi{x,y) 

for every y G Mi with di{x,y) < ti. Similarly, (4.24) implies that there is a 
t 2 > 0 such that 

(4.26) dz{f2{fi{x)), f2{v)) < A2d2{fi{x),v) 
for every v G M 2 with d 2 {fi{x),v) < ^ 2 - Put 

(4.27) U = min(ti, ^ 2 ). 

If G Ml satisfies di{x,y) < tz, then (4.25) implies that 

(4.28) d 2 {fi{x),fi{y)) < Aidi{x,y) < Aitz< t 2 , 
so that (4.26) holds with v = fi{y). It follows that 

(4.29) dz{f 2 {fi{x)), f 2 {fi{y))) < A 2 d 2 {fi{x), fi{y)) < A 2 Ai di(a;, y) 

for every y G Mi with di{x,y) < fo, by (4.25) and (4.26). This shows that 

(4.30) D{f2ofi){x)<Dt,{f2ofi){x)<A2Ai, 

as in (4.20), where Dt,^{f 2 o fi){x) is defined in the same way as before, but for 
mappings from Mi into M3. It is easy to get (4.21) from (4.30), by taking the 
infimum over Ai, A 2 G R that satisfy (4.23) and (4.24), respectively. 



70 


CHAPTER 4. GEOMETRY OF MAPPINGS 


4.3 /c-Valued functions 


Let fc be a field, and let | • | be an absolute value function on k. Also let E 
be a subset of k, and let x be an element of E that is a limit point of E with 
respect to the metric associated to | • |. If / is a fc-valued function on E that is 
differentiable at x, then 


(4.31) 


lim 

y-^x 

y€.Mi 


\x-y\ 


\nx)\. 


In particular, this implies that 


(4.32) 


Dif)ix) = \fix)\ 


in the notation of the previous section, where Mi = E and M 2 = k are equipped 
with the metric associated to | • |. In the other direction, if any /c-valued function 
f on E satisfies 

(4.33) D{f)ix) = 0, 

then / is differentiable at x, with f'{x) = 0 . 

Now let {Mi,di{x,y)) be any metric space again, and let us take M 2 = k, 
equipped with the metric associated to | • |. If / is any /c-valued function on Mi 
and a € k, then it is easy to see that 

(4.34) Diaf){x) = \a\Dif){x) 

for every x € Mi. More precisely, the right side of (4.34) should be interpreted 
as being equal to +00 when D(f){x) = 00 and a 7 ^ 0, and the right side of 

(4.34) may be interpreted as being 0 when a = 0 and D{f){x) = + 00 . If g is 
another fc-valued function on Mi, then 


(4.35) D{f + g){x) < D{f){x) + D{g){x) 

for every x G Mi, with the usual interpretations when D{f)(x) or D{g){x) is 
infinite. Similarly, 

(4.36) D{f + g){x) < max{D{f){x), D{g){x)) 

for every x G Mi when | • | is an ultrametric absolute value function on k. 

Of course. 


(4.37) f{y) g{y) - f{x) g{x) = {f{y) - f{x)) g{y) + f{x) {g{y) - g{x)) 
for every x,y G Mi, which implies that 

(4.38) \ fiy)g{y) - f{x)g{x)\ < \f{y) - f{x)\\giy)\ + \f{x)\\giy) - g{x)\. 


If D{f){x), D{g){x) < 00 , then one can use (4.38) to show that 

(4.39) D{f g){x) < D{f ){x) \g{x)\ + |/(x)| D{g){x). 



4.4. LIPSCHITZ MAPPINGS 


71 


More precisely, if D{g){x) < oo, then g is continuous at x, which permits one 
to approximate \g{y)\ in the first term on the right side of (4.38) by | 5 (x)|. 
Similarly, if | • | is an ultrametric absolute value function on k, then (4.37) 
implies that 

(4.40) |/(y) g{y) - f{x) 6f(a;)| < max{\f{y) - f{x)\\giy)\, |/(x)| \g{y) - gix)\) 
for every x,y G Mi. Using this, one can check that 

(4.41) D{f g){x) <Ta&x{D{f){x)\g{x)\,\f{x)\D{g){x)) 
when D{f){x), D{g){x) < oo. 

Suppose for the moment that a; is a limit point of Mi, since otherwise D of 
any function on Mi is equal to 0 at x, and (4.40) and (4.41) are trivial. Thus 

(4.42) limsup| 5 (?/)| 

V^x 

is dehned, which is equal to |g(a;)| when |g| is continuous at x. If (4.42) is less 
than or equal to | 5 (a:)|, then \g\ is said to be upper semicontinuous at x. If 
f{x) = 0, then the computations in the previous paragraph can be simplified, 
and it is easy to see that 

(4.43) D{f g){x) < D{f){x) limsup |6f(y)| 

y^x 

when D{f){x) and (4.42) are finite, even if D{g){x) = +oo. Of course, there is 
an analogous statement when g{x) = 0. 

4.4 Lipschitz mappings 

Let {Mi,di{x,y)) and {M 2 ,d 2 {u,v)) be metric spaces, and let a be a positive 
real number. A mapping / : Mi M 2 is said to be Lipschitz of order a if there 
is a nonnegative real number C such that 

(4.44) d 2 {f{x),f{y)) < Cdi{x,y)°^ 

for every x,y G Mi. Note that / satishes (4.44) with C = 0 if and only if / 
is constant, and that Lipschitz mappings of any positive order are uniformly 
continuous. One sometimes simply says that / is a Lipschitz mapping when / 
is Lipschitz of order a = 1. 

If / : Ml —>■ M 2 is Lipschitz of order a > 0 with constant C > 0, then 

(4.45) Dr{f){x) < Cr'^-^ 

for every x G Mi and r > 0, where Dr{f){x) is as in (4.14) in Section 4.2. This 
implies that 

(4.46) Dt{f){x)<CG-^ 



72 


CHAPTER 4. GEOMETRY OF MAPPINGS 


for every t > 0 when o > 1, where Dt{f){x) is as in (4.15). Of course, (4.46) 
can also be derived from (4.16). It follows that 

(4.47) D{f){x) < C 
when 0 = 1, and that 

(4.48) D{f)ix)=0 

when o > 1, where D{f){x) is as in (4.17). 

Let (M3, d 3 {w, z)) be another metric space, and suppose that fi : Mi M 2 
is Lipschitz of order oi > 0 with constant Ci > 0, and that /2 : M 2 —>■ M3 is 
Lipschitz of order 02 > 0 with constant C2 > 0. This implies that 

(4.49) d3{h{fi{x))j2{fi{y))) <C2d2{fi{x)Ji{y)r <C2Cl^ di{x,yr^ 

for every x,y € Mi. Thus the composition /2 o fi is Lipschitz of order oi 02 as 
a mapping from Mi into M3, with constant C 2 

In some situations, we may have a mapping / : Mi —> M2 that satisfies 

(4.50) C~^di{x,y)°' < d 2 {f (x), f (y)) < Cdi{x,yY 

for some o > 0 and C > 1, and every x,y G Mi. If o = 1, then / is said to be 
bilipschitz with constant C. Note that / is bilipschitz with constant C = 1 if 
and only if / is an isometric embedding. If a is any positive real number, then 

(4.50) is equivalent to saying that / is Lipschitz of order a with constant C, and 
that the inverse mapping is defined and Lipschitz of order 1/a on /(Mi), 
with constant (7^/“. 

Let fc be a field with an absolute value function | • |, and let us take M2 = k, 
with the metric associated to | • |. Also let fi and /2 be fc-valued functions on 
Ml that are Lipschitz of order a > 0 with constants Ci and C 2 , respectively. It 
is easy to see that fi + /2 is Lipschitz of order a on Mi too, with constant 

(4.51) C 1 +C 2 . 

Similarly, if | • | is an ultrametric absolute value function on k, then fi + f 2 is 
Lipschitz of order a on Mi with constant 

(4.52) max(C'i, (72). 

If a G /c, then a fi is Lipschitz of order a on Mi with constant 

(4.53) \a\Ci. 

If fi and /2 are also bounded functions on Mi, then one can check that fi f 2 is 
Lipschitz of order a on Mi, with constant 

Cl ( sup |/2(a;)|) + ( sup |/i(x)|) C 2 , 

^ xGMi ' ^ xSMi ' 


(4.54) 



4.5. LIPSCHITZ MAPPINGS, CONTINUED 


73 


using (4.38) in Section 4.3. In this case, if | • | is an ultrametric absolute value 
function on k, then /i /2 is Lipschitz of order a on Mi with constant 

(4.55) max(c'i( sup |/ 2 (a;)|),( sup \fi{x)\)C 2 ), 

'' ''xGMi ' ''xGMi ' ' 

because of (4.40). 

Let us now take Mi = R, equipped with the standard metric, and let / be 
a real-valued function on Mi. If 

(4.56) f{x) < f{y) + Cdi{x,y)°^ 

for some a > 0 and C > 0, and for every x,y € Mi, then we also have that 

(4.57) f{y)<f{x) + Cdi{x,y)°^ 

for every x,y G Mi, by interchanging the roles of x and y. It follows that 

(4.58) \f{x) - f{y)\ = ma.x{f{x) - f {y), f (y) - f (x)) < Cdi{x,y)‘^ 
for every x,y G Mi, so that / is Lipschitz of order a with constant C. If 

(4.59) di{x,yY 
is a metric on Mi, then 

(4.60) fp^a{x) = di{x,pY 

satisfies (4.56) with C = 1 for every p G Mi, because of the triangle inequality. 
In particular, this holds for 0 < a < I when di{x,y) is a metric on Mi, and for 
every a > 0 when di(x,y) is an ultrametric on Mi, as in Section 1.2. 

If d{x, y) is any metric on Mi and 0 < 5 < 1, then 

(4.61) di{x,y) = d{x,yf 

is also a metric on Mi, as in Section 1.2. In this case, (4.59) is a metric on Mi 
when 0 < a < 1/6, so that (4.60) is Lipschitz of order a with constant C = 1 
when 0 < a < 1/6. If 6 < 1 and Mi has at least two elements, then this leads to 
nonconstant real-valued functions on Mi that are Lipschitz of order a > 1. Note 
that a locally constant mapping / on any metric space Mi satisfies (4.48) for 
every x G Mi. If Mi is not connected, then there are locally constant mappings 
on Ml that are not constant on Mi. 

4.5 Lipschitz mappings, continued 

Let A: be a field, and let | • | be an absolute value function on k. li x,y Gk and 
j is a positive integer, then 

i-i 

{x-y)^ x’- y^~'-~'^ =x^ -y\ 

1=0 


(4.62) 



74 


CHAPTER 4. GEOMETRY OF MAPPINGS 


by a standard computation. This implies that 

(4.63) |a;^-jz-’l < |a;-y| < jlx-yl (^max(|a;|,|y|)j 

i=o 

If I • I is an ultrametric absolute value function on k, then we get that 

(4.64) \x^-y^<\x-y\ <\x-y\(^uia^{\x\,\y\)^ 

Let 00 , 01 , 02 , 03 ,... be a sequence of elements of k, and suppose for the 
moment that the series 

OO 

(4.65) 

i=i 

converges for some positive real number r. This implies that the series 

OO 

(4.66) '^\aj\G 

j=o 

converges, so that the corresponding power series 

OO 

(4.67) '^ajx^ 

j=o 

converges absolutely when \x\ < r. If fc is complete with respect to the metric 
associated to | • |, then it follows that (4.67) converges in k, and we let f{x) 
denote the value of the sum. Of course, if aj = 0 for all but finitely many j, 
then the completeness of k is not needed, and we still let f{x) denote the value 
of the sum (4.67). In both cases, ii x,y € k and |x|, |j/| < r, then we get that 

OO OO 

(4.68) \f{x)-f{y)\ -y^\ < 

t=i i=i 

by (4.63). 

Suppose now that | • | is an ultrametric absolute value function on k, and 
that the a^’s satisfy 

(4.69) lim \aj \ G = 0 

i-ioo 

for some r > 0, instead of (4.65). If k is complete, then it follows that the power 
series (4.67) converges in k for each x € k with \x\ < r, and we let f{x) denote 
the value of the sum again. As before, the completeness of k is not needed when 
Qj = 0 for all but finitely many j. In both cases, if x,y G k and |a:|, |?/| < r, then 
we have that 

(4.70) \f{x) - f{y)\ < maxda^l \x^ -y^\) < max(|aj| r^“^) \x - y\, 

j>i j>i 


by (4.64). 



4.6. DIFFERENTIATION, CONTINUED 


75 


4.6 Differentiation, continued 

Let fc be a field, and let 

OO 

(4.71) f{X)=J2ajX^ 

3=0 

be a formal power series with coefficients in k. The formal derivative of f{X) 
is defined to be the formal power series 

OO 

(4.72) f{X) = Y,j-a,X^-\ 


(4.73) g{X) = Y,b,X^ 

3=0 

is another formal power series with coefficients in k, then the sum (/ + g){X) = 
f{X) + g{X) is a formal power series too, whose derivative is given by 

(4.74) {f + g)'{X)=nX)+g'{X). 

Similarly, ii a & k, then (a f){X) = a f{X) is a formal power series, whose 
derivative is given by 

(4.75) iafnX) = af'iX). 

Suppose for the moment that fc = R or C, with the standard absolute value 
function. Suppose also that the series (4.65) converges for some positive real 
number r, so that the power series 

OO 

(4.76) 

i=i 

converges absolutely for every x G k with |x| < r. The convergence of the series 
(4.65) implies that the series (4.66) converges as well, and hence that f{x) can 
be defined as a fc-valued function on the closed ball 


(4.77) 


B{0,r) = {x G fc : |a;| < r} 


by the power series (4.67). If x G fc and |x| < r, then one can check that 


(4.78) 


f{y) - fix) 

y-x 


tends to (4.76) as y G fc with |y| < r tends to x. Thus the derivative /'(x) of / 
at X exists and is equal to (4.76), for / as a fc-valued function defined on R(0, r). 
This is elementary when / is a polynomial, and otherwise one should be careful 
about interchanging the order of the limits for the infinite sum. The main point 
is that the errors in the relevant approximations can be estimated as in (4.68). 



76 


CHAPTER 4. GEOMETRY OF MAPPINGS 


If (4.71) has radius of convergence p > 0, then the series (4.66) converges 
when r < p. It is well known that the series (4.65) converges when r < p too, 
which can be derived from the fact that the series 

OO 

(4.79) 

j=o 

converges when r < t < p. Thus f{x) can be defined as a /c-valued function on 
the open ball 

(4.80) B{0,p) = {x € k : \x\ < p} 

by the power series (4.67), and the power series (4.76) converges absolutely at 
every point in B{0,p) as well. If x € k and |a:| < p, then (4.78) tends to (4.76) 
as y G k tends to x. This follows from the analogous statement in the preceding 
paragraph applied to r such that |a;| < r < p. In this case, we do not need to 
explicitly restrict our attention to |p| < r, since this holds automatically when y 
is sufficiently close to x. As before, this means that the derivative f'{x) of / at 
x exists and is equal to (4.76), for f as a fc-valued function defined on 5(0, p). 

Suppose now that k is any field equipped with an ultrametric absolute value 
function | • |, and that the aj’s satisfy (4.69) for some r > 0. This implies that 

(4.81) lim \j ■ Qjl = 0, 

i-s-oo 

since |j - 0^1 < \aj\ for each j S Z-|_, by the ultrametric version of the triangle 
inequality. If k is complete with respect to the ultrametric associated to | • |, 
then it follows that the power series (4.67) and (4.76) converge in k for every 
x G k with \x\ < r. As usual, these sums also make sense when aj = 0 for all 
but finitely many j, even if k is not complete. In both cases, f{x) can be defined 
as a fc-valued function on the closed ball 5(0,r) by the power series (4.67). If 
I • I is not the trivial absolute value function on k, then (4.78) tends to (4.76) 
as y G k tends to x G k with |a;| < r, for essentially the same reasons as before. 
Thus the derivative f'{x) of / at a: exists and is given by the power series (4.76) 
under these conditions. 


4.7 Derivative 0 

Let k he a field, and let f{X) be a formal power series with coefScients in k, 
as in (4.71). Thus the formal derivative f'{X) is equal to 0 as a formal power 
series if and only if 

(4.82) j ■ Oj = 0 

for every positive integer j. II k has characteristic 0, then this is the same as 
saying that 

(4.83) aj = 0 

for every j > 1. Otherwise, if k has characteristic p for some prime number 
p, then (4.82) holds for every j > 1 if and only if (4.83) holds when j is not a 
multiple of p. 



4.7. DERIVATIVE 0 


77 


Let I • I be an absolute value function on k, and suppose that k is complete 
with respect to the associated metric. Suppose also that / has positive radius of 
convergence, and that Oj ^ 0 for some j € Z-|_. Let jo be the smallest positive 
integer with this property, and note that the power series 

OO 

(4.84) ^ 

j=3o 

has the same radius of convergence as /. Thus (4.84) is defined and continuous 
as a fc-valued function on a ball in k centered at 0 with positive radius. This 
implies that (4.84) is not equal to 0 when x € k and \x\ is sufhciently small, 
because (4.84) is equal to aj^ ^ 0 when a; = 0. It follows that 

OO 

(4.85) f{x) = oo + x^° Gj x^~^° 

j=3o 

is different from /(O) = ag when |a;| is sufficiently small and x ^ 0. This shows 
that f{x) is not constant on any neighborhood of 0 when | • | is not the trivial 
absolute value function on k. 

Let k he a, field with characteristic p for some prime number p. It is well 
known that 

(4.86) {x + uY = x^ +y'^ 

for every x,y G k, by the binomial theorem, because (p is divisible by p when 
1 < j < P- Similarly, 

(4.87) {x - yf = xP-yP 

for every x,y G k, because (—1)*’ = —I automatically when p is odd, and 
(—1)^ = 1 = —1 when p = 2. If | ■ | is any absolute value function on k, then it 
follows that 

(4.88) |^P_yP| = |^_y|P 

for every x,y G k. Thus x i—> x^ defines a Lipschitz mapping of order p from 
k into itself, with respect to the metric on k associated to the absolute value 
function. More precisely, this corresponds to (4.50) in Section 4.4, with a = p 
and (7=1. Remember that any absolute value function on k is non-archimedian, 
as in Section 1.6. 

If f{X) is a formal power series with coefficients in k such that f'{X) = 0 
as a formal power series, then f{X) can be expressed as 

(4.89) f{X)=giXP) 

for some other formal power series g, by the remarks at the beginning of the 
section. Suppose again that k is complete with respect to the metric associated 
to I • I, and that / has positive radius of convergence. This implies that g has 
positive radius of convergence too, which is equal to the pth power of the radius 
of convergence of /. As in Section 4.5, the function corresponding to g satisfies 



78 


CHAPTER 4. GEOMETRY OF MAPPINGS 


Lipschitz conditions of order 1 on closed balls in k centered at 0 with suitable 
radii. This leads to Lipschitz conditions of order p for the function corresponding 
to / on closed balls in k centered at 0 with suitable radii, because of (4.88). If 
g' = 0 as a formal power series, then one can repeat the process. Of course, 
the process can only be repeated finitely many times, unless aj = 0 for every 

j e z+. 

4.8 Some related estimates 

Let fc be a held, and let j be an integer with j > 2. Observe that 

j-i 

(4.90) - j ■ {x - y)y^~^ = {x - y)^x'-y^~'-~^ - j ■ {x - y)y^~^ 

1=0 

= {x-y)^{x^-y^)y^~^~^, 

1=0 

for each x,y € k, using (4.62) in Section 4.5 in the hrst step. If | • | is an absolute 
value function on k, then we get that 

(4.91) \x^ -y^ - j '{x-y) y^~^\ < |a; - j/| ^ lx' - i/'| 

i =0 


for every x,y € k. Remember that 

(4.92) |x' - y'l < / |x - y\ (^max(|x|, \y\)^ 

for every x,y € k and I € Z+, as in (4.63) in Section 4.5. Combining (4.91) and 

(4.92) , we obtain that 


^( , s j — 2 

(4.93) \x^ -y^ - j ■{x-y)y^~^\ < |x - yp ^ I max(|x|, |j/|)j 

= '^-^'^^-^k-2/p(max(|x|,|?/|)y 

for every x,y G k. Similarly, if | • | is an ultrametric absolute value function on 
k, then (4.90) implies that 

(4.94) \x^-y^-j ■ix-y)y^~^\<\x-y\ max i\x‘ - y‘\\y\^~‘~^) 
for every x,y G k. In this case, we also have that 

(4.95) |x' -j/'l < |x-y| (^max(|x|, Ij/D) 



4.8. SOME RELATED ESTIMATES 


79 


for every x,y G k and I € Z+, as in (4.64) in Section 4.5. It follows that 

(4.96) \x^ - y^ -j '{x- y)y^~^\ <\x- y\^ (^max(|a;|, lyD^ 

for every x,y G k under these conditions, by combining (4.94) and (4.95). 

Suppose for the moment that fc = R or C, with the standard absolute value 
function. Let oq, oi, 02 , 03 ,... be a sequence of elements of k such that the series 

00 

(4.97) ^j(j-l)|aj|r^“2 

i =2 

converges for some positive real number r. Thus the series (4.65) and (4.66) in 
Section 4.5 converge too, which implies that the power series (4.67) in Section 
4.5 and (4.76) in Section 4.6 converge absolutely when x G k satishes |x| < r. 
Let f{x) be the fc-valued function defined on the closed ball B{0,r) by the power 
series (4.67), whose derivative f'{x) is given by the power series (4.76), as in 
Section 4.6. Note that 

00 

(4.98) f{x) - f{y) - f{y) {x-y) = ^ aj{x^ - y^ - j {x - y) y^~^) 

i =2 

for every x,y G k with |a;|, |y| < r, where the contributions from the j = 0 and 
j = 1 terms automatically cancel. Hence 

00 

(4.99) \f{x) - f{y) - f (y) {x - y)\ <^|aj||x-^ - y^ - j {x -y)y^~^\ 

J =2 

for every x,y G k with |a;|, |y| < r. Combining this with (4.93), we get that 

(4.100) \f{x) - f{y)- f{y){x-y)\ < \x - y]"^ 

J =2 

for every x,y G k with |a:|, |?/| < r. 

Now let I • I be an ultrametric absolute value function on any field k, and let 
00 , 01 , 02 , 03 ,... be a sequence of elements of k that satisfies (4.69) in Section 
4.5 for some r > 0. If fc is complete with respect to the ultrametric associated 
to I • I, then the power series (4.67) and (4.76) converge in k for every x G k 
with |a;| < r, as before. Let the sums of these series be denoted f{x) and f'{x), 
respectively, which can also be defined when aj = 0 for all but finitely many j, 
even if k is not complete, as usual. As in Section 4.6, f'{x) is the derivative of 
f{x) as a fc-valued function f{x) on the closed ball B{0,r) when | • | is nontrivial 
on k. Otherwise, if | • | is the trivial absolute value function on k, then one can 
still define f'{x) by the power series (4.76), but the estimates that follow would 
be trivial. 

Of course, (4.98) holds in this situation too, so that 

(4.101) \f{x) - f{y) - f'{y) ix-y)\< max(|aj| \x^ - y^ - j ■ {x-y)y^~^\) 



80 


CHAPTER 4. GEOMETRY OF MAPPINGS 


for every x,y G k with |a:|, |?/| < r, by the ultrametric version of the triangle 
inequality. Combining this with (4.96), we obtain that 

(4.102) |/(x) - fiy) - f'iy) {x - y)\ < |x - y\^ 

J>2 

for every x,y G k with |a;|, |?/| < r. In particular, it follows that 

(4.103) |/(a;)-/(j/)| < max{\f{x)-f{y)-fiy){x-y)\,\f{y)\\x-y\) 

< max(^max{\aj\G~‘^)\x-y\,\f{y)\j \x - y\ 

for every x,y G k with |a;|, |?/| < r. Note that 

(4.104) \f{x)\ < max(|j • a^l r^“^) < max(|aj| r^-^) 

j>i j>i 

for every x G k with |a;| < r, by the definition (4.76) of f'{x) and the ultrametric 
version of the triangle inequality. One can also check that 

(4.105) \f'{x) - f'{y)\ < max(|j • a^lr-’”^) \x - y\ < max(|aj| r^"^) \x - y\ 

i>2 i>2 

for every x,y G k with |a;|, |y| < r, by applying (4.70) in Section 4.5 to /' instead 
of/. 


4.9 Some related estimates, continued 

Let us continue with the same notation and hypotheses as at the end of the 
preceding section. Also let xq G k and t > 0 be given, with |xo| < r and t < r, 
so that 

(4.106) B{xo,t) Q B{xo,r) = 

by the ultrametric version of the triangle inequality. If y S B{xo,t)j then we 
get that 

(4.107) If'iy) - /'(xo)| < max(|aj|r^"^)t, 

J>2 

by (4.105) applied to x = xo- In particular, 

(4.108) \f{y)\ < max(^|/'(xo)|,rnax(|aj|r^"2)t) 

when y G B{xo,t)- Combining this with (4.103), we obtain that 

(4.109) \f{x)-f{y)\ <max(^\f{xo)\,max{\aj\G-'^)tJ \x - y\ 

for every x,y G B{xo,t), since |a; — 2 /| < t in this case too. 

Put 

(4.110) go{x) = f{x) - /(xo) - /'(xo) (x - xq) 



4.9. SOME RELATED ESTIMATES, CONTINUED 


81 


for each x € B{0,r), so that 

(4.111) l 5 o(a;)| < \x - xo\'^ 

i>2 

for every x € B{0,r), by (4.102). We also have that 

9 o{x)-go{y) = f {x) - f (y) - f (xq) {x - y) 

(4.112) = f{x)-f{y)-fiy)ix-y) + {fiy)-f{xo)){x-y) 

for every x,y € B{0,r), and hence 

(4.113) | 5 o(a;) - goiy)\ 

< max(|/(a;) - f{y) - f{y) {x - i/)|, \f{y) - f{xo)\\x - y\). 

It follows that 

(4.114) \goix) - go{y)\ < max(|x - j/|, |y - xo|) |x - y| 

for every x,y G 5(0,r), by (4.102) and (4.105). li x,y G B{xo,t), so that 
\x — y\ <t too, then (4.114) implies that that 

(4.115) lffo(a;)-ffo( 2 /)| < t\x - y\. 

As in Section 3.9, we can reexpress f{x) = f{xo + (x — xq)) as a power series 
in X — xo, which corresponds to taking bo = xq and y = x — xq in the earlier 
notation. More precisely, we have that 

OO OO j 

(4.116) /(x) = Qj (xo + (x - xo))^ = XI ^ 

j=0 j=0 1=0 

for each x G k with |x| < r, using the binomial theorem in the third step. Put 

(4.117) a/= ^ • x^”' 

for each nonnegative integer I, where the series converges in k for the same 
reasons as before. We have also seen that 

(4.118) |ai| r* < max(|aj| r^) 

j>i 

for each I > 0, and that 

OO 

(4.119) /(x) = (x - xqY 

1=0 

for every x G k with |x| < r. Of course, ag = /(xq) and Oi = /'(xq), so that 

OO 

ffo(x) = Xfo “ 2 ;o)' 

1^2 


■xl * (x - Xo)' 


(4.120) 



82 


CHAPTER 4. GEOMETRY OF MAPPINGS 


for every x G k with \x\ < r. 

II x,y & B{xo,t)j then one can check that 

(4.121) |/(a;) - f{y)\ < inax(|a/| |a; - y\, 

using the same type of estimate as in (4.70) in Section 4.5, applied to the 
expansion (4.119). Eqnivalently, 

(4.122) \f{x)-f{y)\ < max(^|/'(a;o)|,inax(|S/|t'“^)) \x - y\ 

for every x,y £ B{xo,t)j since di = f'{xo). We also have that 

(4.123) iimx(|a;| t^“^) < iimx(|a/| r^“^) t < m^x(|aj | r^“^) t, 

using the fact that t < r in the first step, and (4.118) in the second step. 
This gives another way to look at (4.109), by combining (4.122) and (4.123). 
Similarly, one can verify that 

(4.124) \9o{x) - go{y)\ < inax(|ai| t'-^) |a; - y| 

for every x,y £ B{xo,t)- As before, this uses the same type of estimate as in 
(4.70) in Section 4.5, applied to the expansion (4.120). This gives another way 
to look at (4.115), by combining (4.124) and (4.123). 


4.10 Hensel’s lemma 


Let us continue with the same notation and hypotheses as in the preceding 
section again. Suppose for the moment that x,y £ k satisfy |x|, |j/| < r and 

(4.125) max(|aj|r^"^) Ix-j/l < \fiy)\. 

J>2 

This implies that 

(4.126) \fix)-ny)\<\fiy)\, 
by (4.105) in Section 4.8. It follows that 

(4.127) |/'(x)| = |/'(j/)|, 

as in (1.43) in Section 1.3. Similarly, let us check that 

(4.128) |/(^)_/(y)| = |/'(y)||^_y| 


when X, y satisfy (4.125). Of course, (4.128) is trivial when x = y, 
may suppose that x ^ y. In this case, we can multiply (4.125) by \x - 


that 

(4.129) maxdojl r-^"^) \x - yp < |/'(j/)| \x - y\. 


and so we 
- 2 / 1 , to get 



4.10. HENSEL’S LEMMA 


83 


Combining this with (4.102) in Section 4.8, we obtain that 

(4.130) |/(a;) - /(y) - f{y) (x - i/)| < |/'(y)| |a; - y\. 

This implies (4.128), as in (1.43) in Section 1.3 again. 

Let xo € k and t > 0 be as in the previous section, and suppose from now 
on in this section that 

(4.131) t max(|aj|r^- 2 ) < |/'(a;o)|. 


This implies that every x S B{xo,t) satisfies (4.125) with y = xq. It follows 
that 

(4.132) |/'(x)| = |/'(xo)| 

for every x € B{xo,t), by (4.127) applied to y = xq- Of course, this is the same 
as saying that 

(4.133) \f{y)\ = |/'(xo)| 

for every y € B(xo,t)- If x,y € B(xo,t), then \x — y\ < t, and (4.131) implies 
that (4.125) holds, because of (4.133). Thus (4.128) implies that 

(4.134) |/(a;) - f{y)\ = \f'{xo)\ |a; - y\, 

using (4.133) again. 

It follows that 

(4.135) fiBixoA)) C Bifixo), \f{xo)\t) 

under these conditions, and in fact we have that 

(4.136) f(B{^o,t)) = (B(/(xo), |/'(a:o)| t) 

when k is complete with respect to the ultrametric associated to | . |. This 
is basically Hensel’s lemma, as in [2, 12]. To prove (4.136), we shall use the 
contraction mapping theorem. Note that the completeness of k is important 
here, even when aj = 0 for all but finitely many j. 

Let z be an element of the right side of (4.136), so that z G k satisHes 

(4.137) \f{xo) - z\<\f{xo)\t. 

Put 

(4.138) h^{x) = a;o + f'{xQ)~'^{z - f{xo)) - fixo)~^ goix) 
for each x € B{xo,t), where go{x) is as in (4.110). Thus 

(4.139) f'ixo) (x - hz{x)) = fixo) (x - xq) - z + f{xo) + go{x) = f{x) - z 

for every x € B{xo,t), by the definition (4.110) of go{x). It is easy to see that 
hz is Lipschitz of order 1 with constant 

\f'{xo)\~^ (max(|aj|r^"2)) t 


(4.140) 



84 


CHAPTER 4. GEOMETRY OF MAPPINGS 


on B{xo,t)j by (4.115). The hypothesis (4.131) says exactly that (4.140) is 
strictly less than 1. We also have that 

(4.141) \h^{x) - xo| < \ f'{xo)\~^ max(|z - f{xo)\, |5o(a;)|) 

for every x G B{xo,t), by the definition (4.138) of hz{x). Observe that 

(4.142) l5o(a;)| < max(|aj|r^“^)t^ <t|/'(a;o)| 


for every x G B{xo,t)j using (4.111) in the first step, and (4.131) in the second 
step. This implies that 

(4.143) \h^{x)-xo\<t 

for every x G B{xo,t), by (4.137) and (4.141). Equivalently, 

(4.144) h^{B{xo,t)) C B{xo,t). 

If k is complete with respect to the metric associated to | • |, then B{xo,t) is 
also complete as a metric space with respect to the restriction of this metric 
to B{xo,t)j because B{xo,t) is a closed set in k. This permits us to apply the 
contraction mapping theorem to hz on B{xo,t)j to get that hz has a fixed point 
in B{xo,t)- This is the same as saying that f{x) = z for some x G B(xo,t), by 
(4.139), which implies (4.136). 

4.11 Some variants 

Let us go back to the same notation and hypotheses as at the beginning of 
Section 4.9. Suppose for the moment that x,y €k satisfy |a;|, |j/| < r and 

(4.145) mAx{\aj\G-^)\x-y\ < \f{y)\, 

J>2 

instead of (4.125). This implies that 

(4.146) \f{x)-f{y)\<\ny)l 
by (4.105) in Section 4.8, and hence 

(4.147) \rix)\ < \fiy)\, 

by the ultrametric version of the triangle inequality. Similarly, (4.145) implies 
that 

(4.148) \f{x) - f{y)\ < \ fiy)\\x - y\, 
by (4.103) in Section 4.8. 

Let xo G fc be given, with |xo| < r, as in Section 4.9. Also let to > 0 be 
given, with to < r, so that 


Bixo,to) Q B{xo,r) = 5(0,r), 


(4.149) 



4.11. SOME VARIANTS 


85 


as before. Let us suppose from now on in this section that 

(4.150) to ina^{\aj\N~‘^) < |/'(a;o)|, 

i >2 

instead of (4.131). Of course, if f'{xo) = 0, then (4.150) implies that Uj = 0 for 
every j > 2, which implies in turn that oi = f'{xo) = 0 too. Thus we also ask 
that 

(4.151) f{xo) ^ 0. 

If a; G B{xo,to), then (4.150) implies that x satisfies (4.145) with y = xq, so 
that 

(4.152) _ \f{x)\ <\f{xo)\, 

by (4.147). If X, j/ G B{xo,to), then \x — y\ < to too, and we get that 

(4.153) |/(x) - f{y)\ < \f'{xo)\\x - y\, 

by (4.103) in Section 4.8, (4.150), and (4.152) applied to y instead of x. 

However, if x G B{xo,to), then (4.150) implies that x satisfies (4.125) in 
the previous section with j/ = xq. It follows that (4.132) holds for every x in 
B{xo,to), as before. Similarly, if x,j/ G B{xo,to), then (4.134) holds, for the 
same reasons as before. If 0 < t < to; then t < r in particular, as in Section 
4.9, and t also satisfies (4.131) in the previous section, because of (4.150). This 
implies that (4.136) holds for every t G (0,<o) when k is complete, as before, 
and hence that 

(4.154) /(S(xo, to)) = B{f(xo), |/'(xo)| to). 

Now let ri > 0 be given, and suppose that our sequence oq, oi, 02 , 03 ,... of 
coefficients has the property that 

(4.155) \aj\rl 

is bounded. This implies that the convergence condition (4.69) in Section 4.5 
holds for every r > 0 with r < ri. Thus /(x) is defined by the power series 
(4.67) in Section 4.4 for every x G fc with |x| < ri, and similarly f'{x) is defined 
by the power series (4.76) in Section 4.6 for every x G fc with |x| < ri. Also let 
xo G /c be given, with |xo| < ri, and suppose in addition that 

(4.156) ri max(|aj| rj”^) = max(|aj | rj”^) < |/'(xo)|. 

J >2 J >2 

As before, we ask that /'(xo) ^ 0 too, since otherwise aj = 0 for every j > 1 . 
Under these conditions, we can apply the discussion in the previous section to 
r = t > 0 such that r < ri and |xo| < r, in which case (4.131) follows from 

(4.156) . This implies that (4.132) and (4.134) hold for every x, y in 

(4.157) .B(xo, t) = B{xo, r) = H( 0 , r). 

In particular, (4.132) holds when x = 0, which means that we could have taken 
Xo = 0. It follows that (4.132) and (4.134) hold for every x,y € B{0,ri), by 
taking r close to xi. If fc is complete, then 

(4.158) f{B{xo,ri)) = H(/(xo), |/'(xo)| xi), 
by (4.136) with t = r close to xi again. 



86 


CHAPTER 4. GEOMETRY OF MAPPINGS 


4.12 Some variants, continued 

Let us return to the same notation and hypotheses as at the beginning of Section 
4.9 again. In particular, the sequence of coefficients ao,ai,a 2 ,a 3 ,... is supposed 
to satisfy the convergence condition (4.69) in Section 4.5 for the given r > 0, 
which means that the analogous condition holds for any smaller value of r 
too. Of course, the conditions (4.131) and (4.150) in Sections 4.10 and 4.11, 
respectively, are more easily satisfied with smaller values of r. However, one is 
also supposed to have |xo| < r, and either t < r, as in Sections 4.9 and 4.10, or 
l^ol < r, as in Section 4.11. 

The condition that |xo| < r is trivial when xq = 0, and one can basically 
reduce to that case by expanding f{x) into a power series centered at xq, as in 
(4.119) in Section 4.9. Thus one can get the same conclusions as in Section 4.10 
when 0 < t < r and 

(4.159) t nmx(|di| r'“^) < |/'(xo)|, 

where ai is as in (4.117) in Section 4.9. More precisely, (4.159) replaces (4.131) 
in Section 4.10, and it is easy to see that (4.131) implies (4.159), because of 
(4.118) in Section 4.9. If 0 < t < r, then one might as well replace r with t, as 
in the previous paragraph, so that (4.159) becomes 

(4.160) t max{\ai\t’-~'^) = nmx(|di| t'“^) < |/'(xo)|. 

Similarly, one can get the same conclusions as in the preceding section when 
0 <to<r, /'(xo) ^ 0, and 

(4.161) to inax(|d;| r'“^) < |/'(xo)|, 

instead of (4.150). As in the previous paragraph, (4.150) implies (4.161), because 
of (4.118) in Section 4.9. If 0 < to < ^ and /'(xq) ^ 0, then one might as well 
replace r with to, as before, so that (4.161) becomes 

(4.162) to nmx(|d/| 4“^) = nmx(|di| 4“^) < |/'(xo)|. 

4.13 A basic situation 

Let khe a field, and let | ■ | be an ultrametric absolute value function on k. Also 
let oo, oi, 02 , 03 ,... be a sequence of elements of k, and suppose that 

(4.163) |o,|<l 

for each j > 0, and that 

(4.164) lim \aj\ = 0. 

i-s-oo 

As before, we would like to put 

OO 

fix) =Y^ aj x^ 

j=o 


(4.165) 



4.13. A BASIC SITUATION 


87 


and 

OO 

(4.166) f{x) ='^j ■ aj 

i=i 

for each x G k with |a;| < 1. Of course, these series reduce to finite sums when 
aj = 0 for all but finitely many j, and otherwise these series converge in k 
when k is complete with respect to the ultrametric associated to | • |, because 
of (4.164). If I • I is not the trivial absolute value function on k, then (4.166) is 
indeed the derivative of (4.165), as in Section 4.6. 

Under these conditions, we have that 

(4.167) |/(a;)| < max(|aj| Ix^) < 1 

j>0 

for every x G k with \x\ < 1 , by the ultrametric version of the triangle inequality. 
Similarly, 

(4.168) l/'(a;)l < max(|j • aj\ |xP“^) < 1 

t>i 

for every x € k with |x| < 1, which also corresponds to (4.104) in Section 4.8, 
with r = 1. Note that / is Lipschitz of order 1 with constant equal to 1 as a 
mapping from the closed unit ball B{0, 1) in k into k, by (4.70) in Section 4.5, 
with r = 1 again. 

Let xq € k he given, with |xo| < 1. Suppose for the moment that 

(4.169) max|aj| < |/'(xo)|, 

i>2 

which is the same as saying that (4.131) in Section 4.10 holds with r = t = 1. 
This implies that (4.132) and (4.134) hold for every x, y in 

(4.170) :B(xo,1)=:B(0,1), 

as before. In particular, (4.132) holds with x = 0, so that one could have taken 
xo = 0 here. If k is complete, then 

(4.171) f(B{xo, 1)) = B{f{xo), |/'(xo)|), 
by (4.136). 

Suppose now that f'(xo) ^ 0, and that 

(4.172) |/'(xo)| < max|aj|. 

J>2 

Let to be a positive real number such that 


(4.173) to max|aj| < |/'(xo)|, 

t>2 

which implies that to < 1, by (4.172). Note that 

(4.174) to = \f'{xo)\ 


satisfies (4.173), because of (4.163). Of course, (4.173) is the same as (4.150), 
with r = 1. This implies that (4.132) and (4.134) hold for every x,y G B{xo, to), 
as in Section 4.11. If A: is complete, then (4.154) holds too, as before. As in the 
previous section, one can also consider analogous arguments for the expansion 
of / as a power series around xq. 



CHAPTER 4. GEOMETRY OF MAPPINGS 


4.14 Some examples 

Let fc be a field with an ultrametric absolute value function | • |, as before. Also 
let n be an integer with n > 2, and put 

(4.175) fix) = x” 

for each x £ k with |x| < 1. Thus / is Lipschitz of order 1 with constant equal 
to 1 as a mapping from the closed unit ball B{0, 1) in k into k, as in Section 4.5, 
and as mentioned in the preceding section. In this case, it is easy to see that 
/ cannot be Lipschitz of order 1 with constant strictly less than 1 on 73(0,1), 
because /(O) = 0 and /(I) = 1. 

Put 

(4.176) /'(x)=nx"“^ 

for each x £ k with \x\ < 1, which corresponds to the formal derivative of /, 
and which is the derivative of / as a function on 73(0,1) when | • | is nontrivial, 
as usual. It follows that 

(4.177) |/'(x)| = |n.l||xr\ 

where n • 1 is the sum of n I’s in /c. Suppose for the moment that 

(4.178) |n-l| = l, 
and let xq £k he given, with |xo| = 1, so that 

(4.179) l/'(a^o)| = l. 

In this case, we have equality in (4.172), and we can take to = Ij as in (4.174). 
As before, the restriction of / to 73(xo, 1) is an isometry, and / maps 73(xo, 1) 
onto 73(/(xo, 1)) when k is complete. 

More precisely, if k has characteristic 0, then there is a natural embedding of 
Q into fc, so that | • | induces an ultrametric absolute value function on Q. If the 
induced absolute value function on Q is trivial, then (4.178) holds automatically. 
Otherwise, Ostrowski’s theorem implies that the induced absolute value function 
on Q is equivalent to the p-adic absolute value function on Q for some prime 
number p, as in Section 1.8. In this case, (4.178) holds when n is not divisible 
by p. Similarly, if k has characteristic p for some prime number p, then (4.178) 
holds exactly when n is not divisible by p. 
li x,y £ k and \x\ |p|, then 

(4.180) |x - y| = max(|x|, IpI), 

as in (1.43) in Section 1.3. In this case, |x”| = |x|" ^ |p|” = |y”| for each 
positive integer n, so that 

(4.181) =max(|xr,|pr), 

for the same reasons as in (4.180). Note that this is consistent with the fact that 
(4.175) is Lipschitz of order 1 with constant equal to 1 on 73(0,1), as before. 



4.15. SOME EXAMPLES, CONTINUED 


89 


4.15 Some examples, continued 

Let A: be a field with an ultrametric absolute value function | • | again, and 
consider f{x) = x”, as in (4.175). Suppose now that n = p for some prime 
number p, and that (4.178) does not hold. The case where k has characteristic 
p was discussed in Section 4.7, and so we suppose that k has characteristic 0. 
Thus Ostrowski’s theorem implies that the induced absolute value function on 
Q is equivalent to the p-adic absolute value function, as before. Let us ask 
that the induced absolute value function on Q actually be equal to the p-adic 
absolute value function, which can be arranged by replacing the given absolute 
value function on k by an appropriate positive power of itself. 

Let xq € k he given again, with |a:o| = 1, so that 

(4.182) |/'(a;o)| = l/p. 

The right side of (4.172) in Section 4.13 is equal to 1 in this situation, which 
implies that (4.172) holds with strict inequality. Similarly, the maximal value 
of to > 0 that satisfies (4.173) is given by (4.174). Let us now consider the 
analogous arguments for the expansion of / around xg, as in Section 4.12. 
Using the binomial theorem, we get that 

(4.183) f(x) = {xo + (x - xo))^ = ^ ■ xg“^ (x - xo)^ 

which corresponds to (4.116) in Section 4.9. Thus the coefficients a/ in (4.117) 
reduce to 

(4.184) 

when I < p, and ai = 0 when I > p. This implies that 

(4.185) \ai\ = l/p 

for t = 1,... ,p — 1, while |ao| = |op| = I- As in Section 4.12, we would like to 
choose to > 0 as large as possible so that to < 1 and (4.162) holds. If p = 2, 
then (4.162) reduces to 

(4.186) to < |/'(a:o)| = 1/2, 

so that the maximal value of to is 1/2. 

Otherwise, if p > 2, then 

(4.187) nmx(|a/| 4“^) = max(to/p, t(J“^) 

for each 0 < to < 1, which is to say that the maximum on the left side occurs 
with either t = 2 or t = p. In this case, (4.162) reduces to 

max(to/p,tg“^) < |/'(a:o)| = l/p- 



(4.188) 



90 


CHAPTER 4. GEOMETRY OF MAPPINGS 


Of course, to/p < 1/p automatically when to < 1, and so we can take 

(4.189) to = 
in the context of Section 4.12. 

We can also use the binomial theorem to look at the behavior of / more 
directly. Let x,y G k he given, with |a:| = \y\, since otherwise we already have 
(4.180) and (4.181). Put 

(4.190) R=\x\ = \y\, 

and note that 

(4.191) \x-y\<R, 

by the ultrametric version of the triangle inequality. Using the binomial theorem 
again, we have that 

(4.192) f{x) - f{y) =xP-yP = {y + {x - y))P - yP 

= ^ (^) ' ~ ~ 

= -yy- 

Observe that 

(4.193) • yP-^ {x - yY 
is equal to 

(4.194) {l/p)RP-Yx-y/ 

when I < I < p — I, and to 

(4.195) \x — y\P 

when I = p. In particular, (4.193) is equal to 

(4.196) (l/p)i?^’"^ |a;-pi 

when I = 1, and (4.193) is less than or equal to (4.196) when 2 < ^ < p — 1, 
because of (4.191). Similarly, if |a; — p| < R and x ^ y, then (4.193) is strictly 
less than (4.196) when 2 < ? < p — 1. 

Suppose for the moment that 

(4.197) |a; — p| < i?, 

so that 

(4.198) \x-y\P-^ <il/p)RP-\ 
li X ^ y, then this implies that 

(4.199) |a;-p|P < (l/p)i?^’"^ |a;-p|. 



4.15. SOME EXAMPLES, CONTINUED 


91 


Note that (4.197) implies that \x — y\ < R, so that (4.193) is strictly less than 
(4.196) when 2 < ^ < p — 1 and x y, as in the preceding paragraph. It follows 
that 

(4.200) \f{x) - f{y)\ = (1/p) RP-'^ |a; - y\ 

when x,y £ k satisfy (4.190), (4.197), and x ^ y, and of course (4.200) holds 
trivially when x = y. More precisely, if x ^ y satisfy (4.190) and (4.197), then 
the absolute values of the terms on the right side of (4.192) corresponding to 
I > 2 are strictly less than (4.196), which is the absolute value of the I = 1 term 
on the right side of (4.192). This implies that the absolute value of the left side 
of (4.192) is equal to the absolute value of the I = 1 term on the right side of 

(4.192) , which is the same as (4.200). This uses the same type of argument as 
in (1.43) in Section 1.3. The same conclusion follows from the earlier discussion 
when i? = 1, and it is easy to reduce to that case in this situation anyway, or 
to deal with it in the same way as before. 

Similarly, if 

(4.201) \x-y\=p-^/^P-^^ R, 
then 

(4.202) \x-yr^ = {l/p)RP-\ 
and hence 

(4.203) \x-y\P = {l/p)RP-^\x-y\. 

This implies that 

(4.204) \f{x) - f{y)\ < (1/p) RP-^ \x - y\, 

because (4.193) is less than or equal to (4.196) when 2 < I < p — 1, as before. 
More precisely, all of the terms on the right side of (4.192) have absolute value 
less than or equal to (4.196) under these conditions, which implies (4.204). As 
in the previous paragraph, one could get the same conclusion from the earlier 
discussion when R = 1, and the restriction to i? = 1 is not very serious anyway. 
Now suppose that 

(4.205) \x-y\> R, 

so that 

(4.206) \x-y\P-^ > {l/p)RP-\ 
and thus 

(4.207) \x-y\P > (l/p) RP~^ \x-y\. 

This means that (4.193) is strictly less than (4.195) when 1 < / < p — 1, while 

(4.193) is equal to (4.195) when I = p. It follows that 

(4.208) \fix)-fiy)\ = \x-y\P 

in this case, using (4.192) and the same type of argument as in (1.43) in Section 
1.3 again. 



Chapter 5 


Some additional topics 


5.1 Sums of functions 

Let k he a field with an absolute value function | • |, and suppose that k is 
complete with respect to the metric associated to | • |. Also let E and M be 
nonempty sets, and let aa(x) be a fc-valued function on M for each a £ E. Let 
us say that 

(5.1) a„ 

aeE 

converges pointwise on M if for each x £ M, 

(5.2) Y o-a{x) 

aeE 

converges as a sum in k in the sense discussed in Section 2.6. By definition, this 
means that the corresponding net of Hnite sums 

(5.3) Y <^a{x) 

aeA 

converges in k for each x £ M, where A is a finite subset of E. Remember that 
this happens exactly when (5.2) satisfies the generalized Cauchy criterion as a 
sum in k for each x £ M, as in Sections 2.6 and 2.7, because k is complete. 

Let £°°{M,k) be the vector space of bounded fc-valued functions on M, 
equipped with the supremum norm, as in Section 2.2. If Oq, is bounded as a 
fc-valued function on M for each a £ E, then (5.1) may be considered as a 
sum in £°°{M,k), as in Section 2.6. Thus (5.1) converges in £°°{M,k) if the 
corresponding net of finite sums 

(5.4) Y 

aeA 

converges in £°°{M, k) with respect to the supremum norm, where A is a finite 
subset of E again. In this case, this means that the finite sums (5.3) converge 


92 



5.1. SUMS OF FUNCTIONS 


93 


uniformly on M. In particular, this implies that the finite sums (5.3) converge 
pointwise on M. Remember that £°°{M,k) is complete with respect to the 
metric associated to the supremum norm, because k is complete, by hypothesis. 
It follows that (5.1) converges in k) with respect to the supremum norm 

if and only if (5.1) satisfies the generalized Cauchy criterion with respect to the 
supremum norm, as in Sections 2.6 and 2.7. 

Let us continue to ask that Oq be a bounded /c-valued function on M for 
each a £ E, and let a denote the mapping from E into £°°{M, k) defined by 

(5.5) a n- Qa. 

Similarly, for each x £ M, let a{x) denote the mapping from E into k defined 

by 

(5.6) a I—> aa(x). 

Suppose that a has bounded finite sums on E with respect to the supremum 
norm on £°°{M, k), as in Section 2.8. This implies that a{x) has bounded finite 
sums on E for each x £ M. More precisely, for each x £ M, we have that 

(5.7) ||a(a;)||BFS'(£;,fc) < \\a\\BFS{E,i°^{M,k)), 

where the BFS norms are as defined in Section 2.8. Conversely, if a{x) has 
bounded finite sums on E for each x £ M, and if the BFS norm of a{x) on E 
is uniformly bounded over x £ M, then a has bounded finite sums on E with 
respect to the supremum norm on £°°{M, k). In this situation, it is easy to see 
that 

(5.8) \\a\\BFS{E,e°°{M,k)) = sup \\a{x)\\BFS{E,k)- 

xeM 

If (5.1) converges in £^{M, k), then a has bounded finite sums on FI , as in 
Section 2.8. Similarly, if (5.2) converges in k for some x £ M, then a(x) has 
bounded finite sums on E, and 

(5.9) ^ a q(x) < ||a(x)||Bj’S(B_fc). 

aeE 

If a has bounded finite sums on E, and if (5.2) converges in k for every x £ M, 
then (5.2) defines a bounded fc-valued function on M, with 

(5.10) Q,(x) < \\a\\BFSiE,e-=°{M,k)) 

aGE 

for each x £ M, by (5.7) and (5.9). 

Suppose for the moment that fc = R, with the standard absolute value 
function. If a{x) has bounded finite sums on E for some x £ M, then a(x) is 
summable on E, as mentioned in Section 2.8. More precisely, we have that 

|aa(a;)| < 2 ||a(x)||BFS(£;^R), 

aGE 


(5.11) 



94 


CHAPTER 5. SOME ADDITIONAL TOPICS 


as in (2.85) in Section 2.8. In particular, this implies that (5.2) converges in R, 
as in Section 2.7. If a has bounded finite sums on E, then a{x) has bounded 
finite sums on E for each x € M, as before. This implies that 

(5.12) ^ |a a{x)\ < 2 ||a||Bi?s(B_£oo(M-,R)) 

aGE 

for every x € M, by combining (5.7) and (5.11). Thus (5.2) converges in R for 
each X & M under these conditions, and defines a bounded real-valued function 
on M, as in the preceding paragraph. 

Similarly, if fc = C with the standard absolute value function, then we can 
apply the remarks in the previous paragraph to the real and imaginary parts of 
a. If a{x) has bounded finite sums on E for some x € M, then it follows that 
a{x) is summable on E, with 

(5.13) ^ la a(x)| < 4 ||a(a;)||BSF(i!;.c), 

aeE 

by applying (5.11) to the real and imaginary parts of a{x). If a has bounded 
finite sums on E, then a{x) has bounded finite sums on E for every x € M, and 
we get that 

(5.14) ^ |aa(a;)| < 4:\\a\\BFS{E,e°°{M,c)) 

cteE 

for every x € M, by combining (5.7) and (5.13). This implies that (5.2) con¬ 
verges in C for each x £ M, and defines a bounded complex-valued function on 
M, as before. 

Suppose now that fc is a field with an ultrametric absolute value function | • |, 
and that k is still complete with respect to the associated ultrametric. In this 
case, the supremum norm on k) is an ultranorm, as in Section 2.2. This 

implies that the BFS norm of a function on E with values in k or k) is the 

same as the corresponding supremum norm on E^ as in Section 2.8. Similarly, 
the sum over E oi a function on E with values in k or in k) satisfies the 

generalized Cauchy criterion if and only if the function vanishes at infinity on 
E, as in Sections 2.6 and 2.7. 

Let k be any field with an absolute value function | • | again, and where k is 
still complete with respect to the associated metric. Suppose that M is now also 
equipped with a topology, and let Cb{M, k) be the space of bounded continuous 
fc-valued functions on M. Thus Cb{M,k) is a subalgebra of £°°{M,k) with 
respect to pointwise addition and multiplication, and a closed set in £°°{M,k) 
with respect to the supremum norm. If Oq, £ Cb{M, k) for each a £ E, and if 
the sum (5.1) converges in £°°{M, k) with respect to the supremum norm, then 
the sum is a continuous function on M too. 


5.2 Lipschitz seminorms 

Let fc be a field, let | • | be an absolute value function on k, and let C be a 
vector space over k. A nonnegative real-valued function iV on C is said to be a 



5.2. LIPSCHITZ SEMINORMS 


95 


seminorm on V if 

(5.15) N(tv) = \t\N{v) 
for every v € V and t € k, and 

(5.16) N{v+ w) < N{v) + N{w) 

for every v,w G V. Note that (5.15) implies that N{0) = 0, by taking t = 0, 
and that a seminorm N on V is a. norm on V when N(v) > 0 for every v G V 
with 0. A seminorm iV on 1^ may be called an ultra-seminorm if 

(5.17) N{vw) < ma-K^N^v), N{w)) 

for every v,w gV, which automatically implies (5.16). 

Let {M,d{x,y)) be a (nonempty) metric space, and let 7 be a positive real 
number. Consider the space Lip..^(M, A:) of /c-valued functions on M that are 
Lipschitz of order 7 , as in Section 4.4. More precisely, this uses the metric on k 
associated to the absolute value function. It is easy to see that Lip..^(M, k) is a 
vector space over k with respect to pointwise addition and scalar multiplication, 
as in Section 4.4 again. If / € Lip..^(M, k), then put 

(5-18) ||/||Lip^(M.fc) = : a;,y € M, x 7 ^ y| 

when M has at least two elements, and otherwise put ||/||Lip.^(M,fc) =0- If Af 
has at least two elements, then the right side of (5.18) is the supremum of a 
bounded nonempty set of nonnegative real numbers, and hence is a nonnegative 
real number. Of course, if / is Lipschitz of order 7 on M with constant C, then 

(5.18) is less than or equal to C. It is easy to see that / € Lip.^(M, k) is Lipschitz 
of order 7 with constant equal to (5.18), so that (5.18) may be characterized 
equivalently as the smallest nonnegative real number C such that / is Lipschitz 
of order 7 with constant C. Observe that (5.18) is equal to 0 if and only if / is 
constant on M. One can check that (5.18) defines a seminorm on Lip..^(M, A:), 
as in Section 4.4. If | • | is an ultrametric absolute value function on k, then 

(5.18) defines an ultra-seminorm on Lip..^(M, A:). 

Let 

(5.19) Lipj, .^(M, k) = Lip..^(M, k) O A°°(M, k) 

be the space of bounded A:-valued functions on M that are Lipschitz of order 
7 . Of course, this is a linear subspace of both Lip.^(M, k) and £°°{M, k), which 
are both linear subspaces of the space of all fc-valued functions on M. If M is 
bounded, then every A:-valued Lipschitz function on M of any positive order is 
bounded on M, so that (5.19) is the same as Lip.^(M, k). 

There are two particularly simple ways in which to define a norm on (5.19). 
The first possibility is to put 


(5.20) 


ll/l|Lipi^.^(M,fc) = ||/||Lip.^(M,fc) + ||/|U“(M,fc) 



96 


CHAPTER 5. SOME ADDITIONAL TOPICS 


for every / € Lipf, -yi^, k), where ||/||^=o(M,fc) denotes the supremum norm of /, 
as in Section 2.2. The second possibility is to put 

(5-21) ||/||Lipt_.^(M.fe) = max(||/||Lip^(M,fc), ||/||^~(M.fe)) 

for every / G Lip^ A). It is easy to see that both (5.20) and (5.21) define 
norms on Lipf,^..^(M, k), because ||/||Lip^(M,fe) is a seminorm on Lip..j,(M, k), and 
ll/llt“(M,fe) is a norm on £°°{M,k). Note that (5.21) is less than or equal to 
(5.20), and that (5.20) is less than or equal to 2 times (5.21). This implies 
that (5.20) and (5.21) determine the same topology on Lipj,_.^(M, fc). If | ■ | is 
an ultrametric absolute value function on k, then (5.21) has the advantage of 
being an ultranorm on Lip^ ,j,(M, fc), because ||/||Lip.^(M,fe) is an ultra-seminorm 
on Lip^{M,k), and ||/||^o°(M,fc) is an ultranorm on £°°{M,k). 

If /, g are bounded /c-valued functions on M, then their product f g is 
bounded on M too, and satisfies 

(5-22) ll/5ll^“(M,fc) < ll/ll^°°(M,fc) ll5ll^°°(M,fc)- 

If / and g are also both Lipschitz of order 7 on M, then one can check that / g 
is Lipschitz of order 7 on M as well, as in Section 4.4. More precisely, we have 
that 

(5-23) ||/ff||Lip..,(M,fc) < ll/llLip.^(M,fc) llffll^“(M,fe) + ll/ll^°°(M,fc) ll5llLip.^(M,fc), 

as in (4.54) in Section 4.4. If the norm on Lipf^ ^{M, k) is defined as in (5.20), 
then it follows that 

(5-24) ll/ffllLipt ..,(M,fc) < ll/lkipb .^(M,fc) ll5llLipb_.^(M,fe)- 

This is an advantage of (5.20) in the archimedian case. If | • | is an ultrametric 
absolute value function on k, then we have that 

(5-25)11/ff||Lip^(M,fe) < niax(||/|kjp^(M,fe) ||5lU°°(M,fc), ||/IU°°(M.fc) ||5llLip..,(M,fc)), 

as in (4.55) in Section 4.4. In this case, (5.24) still holds when the Lip^ .yiM, k) 
norm is defined as in (5.21). 

If k is complete with respect to the metric associated to | • |, then Lip^ ,j,(M, k) 
is complete with respect to the norms (5.20) and (5.21), which are essentially 
equivalent for this purpose. To see this, let {/j}^i be a Cauchy sequence of 
elements of Lip^ ..j,(M, fc) with respect to either of these norms. This basically 
means that {/j}^i is a Cauchy sequence with respect to both the supremum 
norm and the Lipschitz seminorm (5.18). The completeness of k) implies 

that converges to a bounded fc-valued function on M with respect to 

the supremum norm. It is easy to see that / is also Lipschitz of order 7 on 
M under these conditions, because the corresponding Lipschitz seminorms of 
the /j’s are bounded. Similarly, one can check that converges to / 

with respect to the Lipschitz seminorm (5.18) under these conditions, using the 



5.3. THE PRODUCT RULE 


97 


Cauchy condition for with respect to the Lipschitz seminorm. This 

implies that converges to / in Lipf, ^^(M, fc), as desired. 

Suppose now that M is a nonempty subset of k, equipped with the metric 
that is the restriction to M of the metric associated to | • | on M, and let us take 
7 = 1. Also let be a sequence of elements of Lip;^(M, k) that converges 

to some / € Lip]^(M, k) with respect to the Lip]^(M, k) seminorm, in the sense 
that 

(5.26) lim ||/, -/||Lip,(M,fc) =0. 

J-S-OO 

Let X be an element of M that is a limit point of M too, and suppose that fj 
is differentiable at x for each j > 1, as in Section 4.1. It is easy to see that 

(5.27) l/'(a;)-//(x)|<||/,-/H|Lip7M.fc) 

for every j,l > 1, since the diffence quotients of a fc-valued function on M are 
bounded by the k) seminorm of the function. Note that 

(5.28) hm ||/,'-/d|Lip 7 M.fc) =0, 

>-oo 

because of (5.26), so that 

(5.29) lim |/'(x) - //(x)| = 0, 

by (5.27). Of course, this says exactly that {fj{x)}'^i is a Cauchy sequence in 
k. If k is complete with respect to the metric associated to | • |, then {fj{x)}'^i 
converges to an element of k. Under these conditions, one can check that / is 
differentiable at x as well, with 

(5.30) f'{x) = lim fj{x). 

J-i-OO 

More precisely, the difference quotients for fj at x converge uniformly to the 
corresponding difference quotients for / at x, because of (5.26). This permits 
one to interchange the order of the limits, by standard arguments. 


5.3 The product rule 

Let fc be a field, and let 

OO OO 

(5.31) f{X) = a. 9{X) = bj 

1=0 1=0 

be formal power series with coefficients in k. The derivatives f'{X), g'{X) of 
f{X), g{X) are defined as formal power series by 

OO OO 

fix) = aj X^-\ g'{X) =J2 j- b, X^-\ 

1=1 1=1 


(5.32) 



98 


CHAPTER 5. SOME ADDITIONAL TOPICS 


as in (4.72) in Section 4.6. The product of f(X) and g(X) is given by 

OO 

(5.33) {fg){X) = f{X)g{X) = J2cnX-, 


where 


— ttj bn-j 


is the Cauchy product of the coefficients of fiX) and q(X), as in (3.17) and 
(3.18) in Section 3.2. Thus 

OO 

(5.35) {fg)'{X) = Y,n-CnX^-\ 


and one can check that 

(5.36) {fg)'{X) = nX) g{X) + f{X) g'{X), 

as in the usual product rule for derivatives. More precisely, it is easy to see that 

n n n 

n- Cn = ^n- ajbn-j = ^(j • a^) ^ ((n - j) 6„_j) 

1=0 1=0 1=0 

n n—1 

(5.37) = ^(j.aj)6„_j((n-j) .6„_j) 

1=1 1=0 

for each n > 1. The two sums on the right side of (5.37) correspond to the 
coefficients of in the two terms on the right side of (5.37), which are given 
by Cauchy products. Of course, these expressions for the Cauchy products are 
slightly different from the usual ones, because of the shifts in the indices for the 
derivatives. 

Suppose for the moment that fc = R or C, with the standard absolute value 
function, and that 


converge for some positive real number r. Put 

OO OO 

(5.39) f{x) = aj , g{x) = Y ^3 

1=0 1=0 

for every x £ k with \x\ < r, where the series in (5.39) converge absolutely by 
the comparison test. If c„ is as in (5.34), then it is easy to see that 


|c„|r” < V(|aj|r^) (|6„_j|r” ^) 


( 5 . 40 ) 



5.3. THE PRODUCT RULE 


99 


for each n > 0. The right side of (5.40) corresponds exactly to the Cauchy 
product of the series in (5.38), so that 


(5.41) 


oo oo oo 


n—0 


j=0 


1=0 


as in Section 3.1. In particular, the left side of (5.41) converges, and we have 
that 


(5.42) 


f{x)g{x) = Ycnx" 


n—0 


for every x G k with |a;| < r, as in Sections 3.1 and 3.6. 
Let us ask that 


(5.43) \ ^ 

i=i i=i 

converge, which implies the convergence of the series in (5.38). As in Section 
4.6, this also implies that / and g are differentiable as functions defined on the 
closed ball B{0,r) in k, with 


fi^) = Y-^ 9'{x) 

i=i i=i 


(5.44) 

for every x G k with |a;| < r. Observe that 

(5.45) 


-1 


,j-i| 


aiW-’ -|)(|6„_j|r” ^) 


' < Eo 

n—l 


j=o 

for each n > 1, using (5.37) and the triangle inequality. The two sums on 
the right side of (5.45) may be considered as Cauchy products, which can be 
summed over n to obtain that 


(5.46) 


E 


n|cJr"-i < 


OO 

(S 


J \o-j\r-’ 


‘)(i:n^ 


i=i 


1=0 


OO OO 


3=0 


1=1 


The sums on the right side of (5.46) are finite by hypothesis, so that the sum 
on the left side of (5.46) is finite too. Thus the discussion in Section 4.6 implies 
that f g IS differentiable as a function defined on 5(0, r), with 

OO 

ifgYix) = 


(5.47) 



100 


CHAPTER 5. SOME ADDITIONAL TOPICS 


for every x & k with \x\ < r, and where the series converges absolutely by the 
comparison test. This implies that 

(5.48) ifgYix) = f'{x) g{x) + f{x) g'{x) 

for every x £ k with |x| < r, by treating the products on the right side of (5.48) 
as Cauchy products, as in (5.36). Note that the discussion of the product rule in 
Section 4.1 also implies that f g \s differentiable as a function defined on 5(0, r), 
with derivative as in (5.48). 

Now let fc be a field with an ultrametric absolute value function | • |, and 
suppose that k is complete with respect to the ultrametric associated to | • |. 
Suppose also that 

(5.49) lim \aj\C = lim \bj\C =0 

j—¥oo j—^OO 

for some positive real number r, which implies that the series in (5.39) converge 
in k for every x € k with \x\ < r. If c„ is as in (5.34), then we get that 

(5.50) |c„|r” < max (|aj| |&„-j|)r” = max (da^lr^) (|6„-j| r”“^)) 

for each n > 0. It follows from this and (5.49) that 

(5.51) lim |c„| r" = 0, 

n—^oo 

and hence that the series on the right side of (5.42) converges in k for every x G k 
with |x| < r. The right side of (5.42) is the same as the Cauchy product of the 
series in (5.39), so that (5.42) holds for every x £ k with |x| < r, as in Section 
3.1. If I • I is not the trivial absolute value function on fc, then the discussion in 
Section 4.6 implies that /, g, and f g are differentiable as functions on 5(0, r) 
in k, with derivatives given by the series in (5.44) and (5.47). In this case, the 
convergence of these series for x £ k with \x\ < r follows from (5.49) and (5.51), 
as in Section 4.6. As before, one can derive (5.48) from (5.47), by treating the 
product on the right side of (5.48) as Cauchy products. One can also use the 
discussion of the product rule in Section 4.1 to get that f g is differentiable as 
a function defined on 5(0,r), with derivative as in (5.48), as in the previous 
situation. 


5.4 The chain rule 

Let fc be a field again, and let 

OO OO 

(5.52) fiX) = Y,a,X^, g{Y)=Y,biY^ 

j=0 1=0 

be formal power series with coefficients in k. As before, the derivatives f'{X), 
g'(Y) of f{X), g(Y) are defined as formal power series by 

OO OO 

f{x) = '£j- % g\Y) = Y^-\ 

j=i 1=1 


(5.53) 



5.4. THE CHAIN RULE 


101 


as in (4.72) in Section 4.6. Suppose for the moment that aj = 0 for all but 
hnitely many j, or that bo = 0. In both cases, the composition 


(5.54) 


ifog){Y) = fig{Y)) = J2a,g{Yy 
j=o 


of f{X) and g{Y) can be dehned as a formal power series with coefficients in k 
too, as in Section 3.8. Similarly, the composition 


(5.55) 


if o g)iY) = f\g{Y)) = ^ j • a, g{Yy-^ 

1=1 


of f{X) with g{Y) can be defined as a formal power series with coefficients in 
k as well. Put 

(5.56) {g^){Y)=g(Yy 
for each positive integer j, and observe that 

(5.57) ifyiY)=j-g{Yy-^g'{Y) 

by the product rule, where (gfiY) is the derivative of {gf{Y). Using this, one 
can verify that 

(5.58) ifogY(Y) = f'igiY))g'{Y), 

where (/ o gy{Y) is the derivative of (/ o g){Y). Of course, this is a version of 
the chain rule for formal power series. 

Suppose now for the moment that Uj = 0 for all but finitely many j, and 
that bi = 0 for all but finitely many 1. Thus 


(5.59) 


fif = Yl “1 ’ 9iy) = Yl y’’ 

1=0 


1=0 


are defined for all x,y G k, and hence their composition 

OO 

(5.60) (/ o g){y) = figiy)) = ^ aj g{yy 

1=0 

is defined for every y G k too. If k is equipped with a nontrivial absolute value 
function | . |, then the derivatives of / and g can be defined as in Section 4.1, 
and are given by 


(5.61) 


= ^ g'iy) = ^l-bif 

1=1 


-1 


Z=1 


for every x^y G k. Similarly, the composition 


if °9)iy) = figiy)) = ^j-ajgiyy ^ 

1=1 


(5.62) 



102 


CHAPTER 5. SOME ADDITIONAL TOPICS 


of f' with g is defined for every y € k under these conditions. The usual version 
of the chain rule implies that 

(5-63) (/ o gy{y) = f{g{y)) g'{y) 

for every y € k, where {fog)'(y) is the derivative of fog at y. More precisely, the 
discussion of the chain rule in Section 4.1 implies that the value of the derivative 
of /og at any point y G k is given as in (5.63). Alternatively, {f o g){y) may be 
expressed as a polynomial in y, whose derivative is the same as the right side of 

(5.63) as a product of polynomials in y, as in the previous paragraph. 

Now suppose that | • | is a nontrivial ultrametric absolute value function 
on k, and that k is complete with respect to the ultrametric associated to | • |. 
Suppose also that oq, ui, 02 , 03 ,... and 60 , 61 , 62 , & 3 , • • • are sequences of elements 
of k that satisfy (3.112), (3.113), and (3.114) in Section 3.7 for some r, t > 0. 
This implies that f{x) and g{y) can be defined as in (5.59) for every x,y G k 
with |a;| < r and |y| < t, and that | 3 (y)| < r for all such y. This permits one 
to define f{g{y)) for every y G k with |y| < t, and the discussion in Section 3.7 
shows that f{g{y)) is given by a power series in y, with suitable convergence 
properties. As in Section 4.6, / and g are differentiable as /c-valued functions on 
the closed balls B{0,r) and B{0,t) in k, respectively, with derivatives given as 
in (5.61). These series for the derivatives have convergence properties like those 
for / and g, and in particular the discussion in Section 3.7 can be applied to /' 
instead of /, to get that f'{g{y)) can be expressed by a power series in y with 
suitable convergence properties as well. As before, the discussion of the chain 
rule in Section 4.1 implies that (5.63) holds for every y G k with |y| < t. Note 
that the right side of (5.63) is given by the Cauchy product of the power series 
for f'{g{y)) and g'{y), which is a power series with the same type of convergence 
properties. Similarly, the left side of (5.63) can be obtained by differentiating 
the power series expansion for (/ o y)(y), as in Section 4.6. One can check that 

(5.63) holds as an equality between power series in y as well, which is to say 
that the power series on both sides of the equation have the same coefficients. 
More precisely, this can be verified directly, but we shall not go through the 
details here. Otherwise, this can be obtained from the fact that (5.63) holds for 
every y G k with |y| < t, since t > 0 and | • | is nontrivial on k, and since both 
sides of (5.63) 

Suppose now that fc = R or C, with the standard absolute value function, 
and that uq) ai) ^ 2 , ^ 3 , • • • and 60 , 61 , 62 , ^ 3 , ■ • ■ are sequences of elements of k that 
satisfy (3.100) and (3.101) in Section 3.7 for some r, t > 0. As in the previous 
situation, this implies that f{x) and y(y) can be defined as in (5.59) for every 
x,y G k with |a;| < r and |y| < t, and that |y(y)| < r for all such y. Thus f{g{y)) 
is also defined for every y G k with |y| < t, and the discussion in Section 3.7 
shows that f{g{y)) is given by an absolutely convergent power series. In order 
to deal with derivatives, let us ask in addition that 

00 00 

j=i 1=1 


(5.64) 



5.5. FUNCTIONS OF SUMS 


103 


converge. This implies that / and g are differentiable on the closed balls B{0, r) 
and B{0,t) in k, respectively, as in Section 4.6, with derivatives given as in 
(5.61). As before, the discussion in Section 3.7 can be applied to /' instead of 
/, to get that f'igiy)) is given by an absolutely convergent power series in y too. 
The discussion of the chain rule in Section 4.6 implies that f og is differentiable 
on B{0,t), and that (5.63) holds for every y € k with |?/| < t. However, one 
should be a bit more careful about the convergence of the power series for the 
derivative of / o g in this case. Let cq, ci, C2, C3,.. . be the coefficients of the 
power series expansion for (/ o g){y), as in Section 3.7. One can check that 

00 

(5.65) n |c„| 

n—1 

converges under these conditions, using the convergence of the series in (5.64). 
This can be done directly, along with showing that (5.63) holds as an equality 
between power series. Alternatively, the convergence of the power series for 
{f og){y) when \y\ < t implies that the power series for the derivative of {fog){y) 
converges when |j/| < t. It follows that the derivative of (/ o g){y) is given by 
this power series when \y\ < t, as in Section 4.6. Both factors on the right 
side of (5.63) are already given by power series that converge absolutely when 
lyl < t, and so their product has the same property. As before, the power series 
on both sides of (5.63) have to have the same coefficients, because (5.63) holds 
for all y € k with \y\ < t and t > 0. This permits the convergence of (5.65) to 
be derived from the absolute convergence of the series on the right side of (5.63) 
when |?/| = t. Of course, the convergence of (5.65) implies that the derivative of 
if o 9)iy) on B{0, t) is given by the corresponding power series for every y G k 
with |y| < t, as in Section 4.6. 


5.5 Functions of sums 

Let fc be a Held, and let 

00 

(5.66) f{x) = Gj 

j=o 

be a power series with coefficients in k. Also let H be a nonempty set, and let 
5 be a A:-valued function on B. We would like to consider 

00 

(5-67) /(E^w) 

leB j=o leB 

at least formally at first. Of course, there is no problem with this when aj = 0 
for all but finitely many j > 0, and b{l) = 0 for all but finitely many I G B. 
This is analogous to the discussion in Section 3.7, which corresponds to the case 
where H = Z+ U {0} and h{l) = bi y’' for some bi,y G k. 

As in Section 3.7, let 

(5.68) Ej = B^ 



104 


CHAPTER 5. SOME ADDITIONAL TOPICS 


be the jth Cartesian power of B for each positive integer j, consisting of j-tuples 
a = (ai ,... ,aj) of elements of B. Put 

(5.69) = b{ai) b{a2) ■ ■ • b{aj) 
for each j G Z+ and a G Ej, so that 

(5.70) (E^W)'= E 

leB aeEj 

at least formally again. As before, there is no problem with this when b{l) = 0 
for all but finitely many I G B. Note that the sets Ej are considered to be 
pairwise disjoint. 

As in Section 3.7 again, we let Eq be a set with exactly one element, not 
contained in Ej for any j > 1. Thus the sets Ej are pairwise disjoint for all 
j > 0, and we put 

OO 

(5.71) E=\Je,. 

j=o 

Let (j) be the fc-valued function on E defined by 

(5.72) = o,j Pjicx) 

for each a G Ej when j > 1, and (j) = qq on Eq. Combining (5.67) and (5.70), 
we get that 

OO 

(5.73) /(E^(^0 = E“1 ( E = E 

leB j=0 aeEj aGE 

at least formally again. As usual, if aj = 0 for all but finitely many j > 0, and 
b{l) = 0 for all but finitely many I G B, then (j) G coo{E,k), and there is no 
problem with (5.73). 

Suppose for the moment that A: = R or C, with the standard absolute value 
function. Suppose also that 

OO 

(5.74) 

3=0 

converges for some nonnegative real number r, and that 

(5.75) 

leB 

where the sum on the left side of (5.75) can be defined as in Section 1.10. The 
convergence of (5.74) implies that the right side of (5.66) converges absolutely 
for every x G k with \x\ < r. The finiteness of the sum on the left side of (5.75) 
means that b{l) is summable as a fc-valued function on B, as in Section 2.3. 
This implies that ^(0 can be defined as in Sections 2.6 and 2.7, and that 

E^(o < E - E 

1GB 1GB 


(5.76) 



5.5. FUNCTIONS OF SUMS 


105 


as in (2.61) in Section 2.7. 

It follows that the series in j on the right side of (5.67) converges absolutely 
under these conditions, which can be used to define the left side of (5.67). We 
also have that 

(5.77) E i/3.(«)i = (Ei^«i)'<^^' 

aeEj leB 

for each j > 0, so that 

OO OO 

(5.78) E +E ( E l«jl l/ 3 i(a)l) < E 

aGE j — 1 olGEj j—Q 

In particular, (5.77) implies that /3j is summable on Ej for each j > 0, and (5.78) 
implies that (j) is summable on E. The summability of j3j on Ej means that the 
right side of (5.70) can be defined as in Sections 2.6 and 2.7, and this sum can 
be evaluated as an iterated sum to get (5.70). Similarly, the summability of (j) 
on E means that the right side of (5.73) can be defined as in Sections 2.6 and 
2.7, and the second step in (5.73) can be obtained as in Section 2.9. 

Now let k be any field with an ultrametric absolute value function | • |, where 
k is complete with respect to the corresponding ultrametric. Suppose that 

(5.79) lim \aj \ N = 0 

j^oo 

for some nonnegative real number r, 

(5.80) b vanishes at infinity on B, 
and 

(5.81) max|5(/)|<r. 

igb 

Thus the series on the right side of (5.66) converges in k for every x & k with 
\x\ < r, because of (5.79) and the completeness of k. Similarly, ^(0 can 

be defined as an element of k as in Sections 2.6 and 2.7, because of (5.80) and 
the completeness of k. We also have that 


(5.82) 




leB 


< max 16(01 < r, 
leB 


as in (2.62) in Section 2.7. 

As before, the series on the right side of (5.67) converges in k in this situation, 
which can be used to define the left side of (5.67). It is easy to see that 

(5.83) /3j vanishes at infinity on Ej 

for each j > 1, because of (5.80), and that 

mM|/?i(a)|= (mmc|6(0|)^ < 


(5.84) 



106 


CHAPTER 5. SOME ADDITIONAL TOPICS 


by (5.81). This implies that 

(5.85) max \(j){a)\ = |aj| max \l3j{a)\ < \aj\C 

OL^Ej ocGEj 

for each j > 1, which tends to 0 as j —>■ oo, by (5.79). It follows that 

(5.86) (j) vanishes at infinity on E, 

since (5.83) implies that the restriction of <j) to Ej vanishes at infinity for each 
j > 1. As in Section 2.6, (5.83) implies that 

(5.87) ^ 13j (a) satisfies the generalized Cauchy criterion 

aGEj 

for each j > 1, and similarly (5.86) implies that 

(5.88) ^ (j){a) satisfies the generalized Cauchy criterion. 

ctGE 

Thus these sums can be defined in fc, because k is complete, as in Section 2.7. 
One can also check that (5.70) and (5.73) hold under these conditions, using 
the remarks in Section 2.9. 


5.6 The logarithm 

Let fc be a field of characteristic 0, and consider 

“ (- 1)^+1 

(5.89) log(l+x)=^^^ - 

as a formal power series with coefficients in k, where X is an indeterminate. If 
k is equipped with an absolute value function | • |, then we may put 

“ (- 1 ) 1+1 

(5.90) log(l + a;) = 

i=i 

for every x € k such that the series on the right side of (5.90) converges. In 
particular, this series converges when a: = 0, so that 

(5.91) log 1 = 0. 

Of course, this is the usual power series of the logarithm, so that (5.90) may be 
considered as the definition of a logarithm function on a subset of k. Note that 
the formal derivative of (5.89) is given by 

OO OO 

^(-1)1+1 Afi-i = 

i=i 1=0 


(5.92) 



5.6. THE LOGARITHM 


107 


which is the power series associated to 


(5.93) 


1 

TTx' 


Suppose for the moment that A: = R or C, with the standard absolute value 
function. In this case, it is well known that radius of convergence of (5.89) is 
equal to 1. More precisely, ii x G k and |a;| < 1, then the series on the right side 
of (5.90) converges absolutely, by comparison with the convergent series 


(5.94) \x\^ 

of nonnegative real numbers. However, the series on the right side of (5.90) does 
not converge absolutely when \x\ = 1 , which is the same in this case as saying 
that it does not converge when x = —1. If x = 1, then the right side of (5.90) 
does converge, by Leibniz’ alternating series test. Similarly, if x G C, |x| = 1, 
and X 7 ^ —I, then one can show that the right side of (5.90) converges in C, as 
in Theorem 3.44 on p7I of [22]. Of course, the logarithm can be defined in other 
ways for all positive real numbers, and it can be extended holomorphically to 
suitable domains in the complex plane. These extensions satisfy 

(5.95) ^log 2 =-, 

dz z 

and indeed this can be used to define the logarithm, together with log 1 = 0 . 

Now let fc be a field of characteristic 0 with an ultrametric absolute value 
function | • |, and suppose that k is complete with respect to the ultrametric 
associated to | • |. Thus the series on the right side of (5.90) converges in k exactly 
when the terms of the series converge to 0 in k. Remember that | • | induces an 
ultrametric absolute value function on Q, using the natural embedding of Q in 
k. If the induced absolute value function on Q is trivial, then 

(5.96) |(—I)-’'*'^ x^7j| = jxP 


for every x G k and j G Z-|_. In this case, the series on the right side of (5.90) 
converges exactly when jxj < 1. Suppose for the moment that j • | is not the 
trivial absolute value function on k, so that every point in the open unit ball 
i?(0,1) in fc is a limit point of H(0,1). As in Section 4.6, the derivative of 
log(l + x) as a fc-valued function defined on H(0,1) exists at every point in 
B(0, 1), and is given by the corresponding power series for the derivative. Thus 
the derivative of log(l + x) is equal to 


(5.97) 


1 

I + X 


for every x G k with jxj < I, since the power series for the derivative of log(l + x) 
corresponds to the usual power series for (5.97), as before. Of course, if | • | is 



108 


CHAPTER 5. SOME ADDITIONAL TOPICS 


the trivial absolute value function on k, then log(l + a:) is defined only for a; = 0 , 
and so the derivative is not defined. 

If the induced absolute value function on Q is not trivial, then there is a 
prime number p such that the induced absolute value function on Q is equivalent 
to the p-adic absolute value function, by Ostrowski’s theorem, as in Section 1.8. 
In this case, we may as well ask that the induced absolute value function on Q 
be equal to the p-adic absolute value, since this can be arranged by replacing 
the given absolute value function | • | on fc by a suitable positive power of itself. 
This implies that 

(5.98) \{-iy+^x^/j\ = \x\^/\j\, 

for every x G k and j € Z+, where \j\p is the p-adic absolute value of j. It is 
easy to see that 

(5.99) I/j < \j\p < 1 
for every j € Z+, so that 

(5.100) |xP < |(-1)^+^ xVjI < j \x\^ 

for every x G k and j € Z+. It follows that (5.98) tends to 0 as j —>■ oo exactly 
when I a; I < 1 in this situation. Equivalently, this means that the right side of 
(5.90) converges in k exactly when |x| < 1 under these conditions. As in the 
preceding paragraph, the derivative of log(I + x) as a /c-valued function on the 
open unit ball B{0, 1) in k exists at every point in B{0, 1), and is equal to (5.97), 
by the discussion in Section 4.6. 


5.7 The usual identity 

If u, V are positive real numbers, then 

(5.101) log(uu) = logu + logv, 

where the logarithm refers to the standard real-valued logarithm function on 
the positive half-line. There are analogous statements for complex numbers, 
but one should be careful about the conditions under which they hold. 

Now let fc be a field with characteristic 0, and let Y and Z be commuting 
indeterminates. Observe that 

(5.102) {1 + Y){1 + Z) = l + Y + Z + Y Z, 
so that 

= \og{l + Y + Z + Y Z) 

= {Y + Z + YZy, 


(5.103) log((l+y)(l + Z)) 



5.7. THE USUAL IDENTITY 


109 


at least formally. More precisely, the right side of (5.103) can be defined as a 
formal power series in Y and Z, by expanding 

(5.104) (Y + Z + YZy 

into a finite sum of monomials for each j. Each of the monomials that occurs in 
the sum has total degree in Y and Z greater than or equal to j, and less than 
or equal to 2j. Any particular monomial in Y and Z can occur in (5.104) for 
only finitely many j, and hence the coefficient of such a monomial in the right 
side of (5.103) is given by a finite sum in k. 

In analogy with (5.101), we have that 

(5.105) log((l + r) (1 + Z)) = log(l + Y)+ log(l + Z), 

as an equality between formal power series in Y and Z. One way to look at 
this is to start with the case where k = R or C, where the identity for the 
corresponding functions near 1 implies the appropriate identities between the 
power series coefficients. These identities between the power series coefficients 
are simply statements about finite sums of rational numbers, which carry over 
to any field of characteristic 0. Alternatively, one can check that the formal 
derivatives of both sides of (5.105) in Y and Z are the same, as formal power 
series in Y and Z. This implies (5.105), because the constant terms on both 
sides of the equation are equal to 0, and because k has characteristic zero, so 
that the non-constant terms can be recovered from their first derivatives. 

If I • I is an absolute value function on k, then 

(5.106) {w G /c : licl = 1} 

is a group with respect to multiplication. Suppose now that | • | is an ultrametric 
absolute value function on k, and let us check that 

(5.107) {w e A: : Iw-1| < 1} 

is a subgroup of (5.106). li w € k and lie — 1| < 1, then it is easy to see that 
Iwl = 1, as in (1.43) in Section 1.3. We also have that 

(5.108) l/w-l = il-w)/w, 
so that 

(5.109) ll/ic — 1| = |1 — icl/lwl = |1 — icI < 1, 
which means that 1/w is an element of (5.107) too. Note that 

(5.110) {l+y){l + z) = l+y + z + yz 
for every y,z G k, as in (5.102), so that 

(5.111) 1(1-f y) (1-f 0 ) - 1| = \y + z + yz\ < max(|y|, |z|, |?/| |z|). 



110 


CHAPTER 5. SOME ADDITIONAL TOPICS 


If |y|, l^l < 1, then it follows that 

(5.112) |(l + 2 /)(l + 2 )-l| < 1. 

This implies that (5.107) is closed under multiplication, as desired. 

Let us continue to suppose that | • | is an ultrametric absolute value function 
on k, and let us also ask that k be complete with respect to the ultrametric 
associated to | • |. II x € k and \x\ < 1 , then log(l + x) can be defined in k by 
(5.90), as in the previous section. Similarly, if y, z S fc satisfy |y|, \z\ < 1, then 

(5.113) log((l + y)(l + z)) = \og{l + y + z + yz) 

“ (- 1 ) 1+1 

= ^- {y + z + yzY 

can be defined in k as before, because of (5.112). Under these conditions, one 
can show that 

(5.114) log((l + y) (1 + z)) = log(l + y) + log(l + z). 

Let us mention two ways to look at this, following [2, 12]. 

In the first approach, one can begin by expanding 

(5.115) {y + z + yzY 

into a sum of monomials in y and z, as we did before for (5.104). If one plugs 
the resulting sums into the right side of (5.113), then one would like to rearrange 
the terms to get a sum that corresponds to the formal power series expansion 
in y and z. More precisely, this can be done using the discussions in Sections 
2.9 and 5.5. This permits (5.II4) to be derived from the analogous statement 
(5.105) for formal power series, as on p281 of [2]. 

Alternatively, let y € fc with |y| < 1 be given, and consider 

(5.116) log((l + y)(l + z)) = log(l+ y + (1+ y)z) 

“ (_l)J+i 

= ^—{y + {^ + y)zy 

as a function of z. This can be converted into a power series in z that converges 
for |z| < 1, as in Section 3.9. Thus (5.114) may be treated as an equation 
relating power series in z, with y as a constant. Note that (5.114) holds trivially 
for every y,z € k with |y|, |z| < 1 when | • | is the trivial absolute value function 
on k, since y = z = 0. Thus we may as well suppose that | • | is not the trivial 
absolute value function on fc, so that every element of the open unit ball 5(0, 1 ) 
in fc is a limit point of 5(0,1). In this case, the derivative of log(l + z) as a 
fc-valued function on 5(0,1) is equal to 

1 


(5.117) 


1 + z 



5.8. SOME ADDITIONAL PROPERTIES 


111 


for every 2 ; G _B(0,1), as in the previous section. This implies that the derivative 
of the right side of (5.114) as a fc-valued function of z on B{0, 1) is equal to 
(5.117) too. Similarly, one can check that the derivative of the left side of 
(5.114) as a /c-valued function of z on B{0, 1) is equal to (5.117), using the chain 
rule. Both sides of (5.114) can be expressed as power series in z, as before, and 
so their derivatives are given by the corresponding differentiated power series in 
z, as in Section 4.6. It follows that the coefficients of the differentiated power 
series in z corresponding to both sides of (5.114) are the same, since the values 
of the derivatives are the same on B{0, 1), and | ■ | is not the trivial absolute 
value function on k. Thus the coefficients of the power series in z corresponding 
to both sides of (5.114) are the same, except perhaps for the constant terms, 
because k has characteristic 0. Of course, (5.114) obviously holds when z = 0, 
which means that the constant terms of the power series in z corresponding to 
both sides of (5.114) are the same as well. This implies that (5.114) holds for 
every z € k with |z| < 1, as in the proof of Proposition 4.5.3 on pllO of [12]. 


5.8 Some additional properties 

Let k he a field, and let | • j be an absolute value function on k. li u,v G k and 
|m| = |u| = 1, then 

(5.118) \iu/v) — 1| = |u — v\/\v\ = \u — z;|. 

Suppose now that | • j is an ultrametric absolute value function on k, and that 
y,z & k satisfy \y\, \z\ < 1. Remember that u = 1 + y and v = 1 + z satisfy 
|m| = |u| = 1, as in the previous section. In this case, (5.118) implies that 

(5.119) |(l+y)/(l + ^)_l| = |y_^|. 


Let us suppose for the rest of the section that fc is a held of characteristic 
0 with an ultrametric absolute value function | • |, and that k is complete with 
respect to the ultrametric associated to j • j. Let us also suppose that the induced 
absolute value function on Q is trivial, li x G k and |a:| < 1, then 


(5.120) 


|log(l + a;)| = |a:| 


Of course, this is trivial when a; = 0, and so it suffices to verify that (5.120) 
holds when x 0. Observe that 


(5.121) 


I log(l +x) - x\ = 




i=2 


J 


I < max \xM j\, 

- j>2 ' 


using the dehnition (5.90) of log(l + x) in the hrst step, and the ultrametric 
version of the triangle inequality in the second step. It follows that 


I log(I x) — x\ < max \xY = |a;| 
i>2 


(5.122) 



112 


CHAPTER 5. SOME ADDITIONAL TOPICS 


when the induced absolute value function on Q is trivial, and hence that 

(5.123) I log(l + x) — a;| < |a;| 

when X ^ 0 and \x\ < 1. This implies (5.120), using the ultrametric version of 
the triangle inequality again, as in (1.43) in Section 1.3. 

Let us check that 

(5.124) I log(l + y)- log(l + z)\ = \y-z\ 

for every y, z G k with |t/|, \z\ < 1 under these conditions. Note that 

(5.125) |(l + y)/(l + z)-l|<l, 

by (5.119), or the fact that (5.107) is a group with respect to multiplication. 
Thus the logarithm of (1 + y)/{l + z) is defined, and 

(5.126) log((l + y)/{l + z)) = log(l + y) - log(l + z), 
by (5.114). It follows that 

(5.127) |log(l+j/)-log(l + z)| = |log((l+y)/(l + z))| 

= \0-Py)l0- + z)-l\ = \y-z\, 

as desired, using (5.120) in the second step, and (5.119) in the third step. One 
could also get (5.124) as in (4.128) or (4.134) in Section 4.10. 

It will be convenient to put 

(5.128) /(x) = log(l+x) 

for every x € fc with |x| < 1 , so that / defines a /c-valued function on the open 
unit ball i?(0,1) in k. Of course, / maps B{0, 1) into itself, by (5.120), and in 
fact one can check that 

(5.129) /(i?(0,l)) = 5(0,1), 

using Hensel’s lemma. More precisely, one can take xq = 0 in Section 4.10, and 
0 < r = t < 1. Thus / maps 5(0, t) onto itself for every t G (0,1), as in (4.136), 
since /(O) = 0 and /'(O) = 1. This implies (5.129), which also corresponds to 
(4.158) in Section 4.11, with ri = 1. 


5.9 Some additional properties, continued 

Let fc be a field with an ultrametric absolute value function | • |. It is easy to see 
that 

(5.130) {w G A: : |u; — 1| < r} 

is a group with respect to multiplication when 0 < r < 1, and similarly that 

(5.131) {w G A: : Iw — 1| < r} 



5.9. SOME ADDITIONAL PROPERTIES, CONTINUED 


113 


is a group with respect to multiplication when 0 < r < 1. More precisely, 
(5.130) and (5.131) are subgroups of the group 

(5.132) {wGk:\w\ = 1} 

with respect to multiplication. If r = 1, then (5.130) is the same as (5.107) in 
Section 5.7, which we have already seen is a subgroup of (5.132). Essentially 
the same argument works in the other cases, using (5.109) and (5.111). 

Let us suppose for the rest of this section that fc is a field of characteristic 
0 with an ultrametric absolute value function | • |, and that k is complete with 
respect to the associated ultrametric. If the induced absolute value function on 
Q is not trivial, then it is equivalent to the p-adic absolute value function on 
Q for some prime number p, by Ostrowski’s theorem, as in Section 1.8. Let 
us suppose also that the induced absolute value function on Q is equal to the 
p-adic absolute value function, which can always be arranged by replacing | • | 
on /c by a suitable positive power of itself. 

Observe that 

i-i 

(5.133) p'_l = (p_l) 

m—0 

for every positive integer I, which also holds when I = 0, with the sum interpreted 
as being equal to 0. If r is a nonnegative real number such that 

(5.134) r < 
then it follows that 

(5.135) p'rP‘"^ < 1 

for each nonnegative integer 1. Remember that the p-adic absolute value \j\p of 
j S Z-|_ is given by 

(5.136) 

where l{j) is the largest nonnegative integer such that j is an integer multiple 
of If r > 0 satisHes (5.134), then it follows that 

(5.137) Si < 1 

for every j > 1. More precisely, this uses the facts that j > and r < 1 in 
the first inequality, and (5.135) in the second inequality. 

If a; S fc and |a;| < 1, then log(l + a;) is defined in k as in Section 5.6, and 

(5.138) I log(l + a;)| < max \x^/j\ = max(|a:P7|j|p) = max(p*^'^^ kP)) 

i>i i>i a>i 

using the ultrametric version of the triangle inequality in the first step. If 

(5.139) \x\<p-^/^P-^\ 
then we have that 

(5.140) \x\^-^ < 1 



114 


CHAPTER 5. SOME ADDITIONAL TOPICS 


for every j > 1, by (5.137). Combining this with (5.138), we get that 

(5.141) I log(l + a;)| < |a;| 

for every x € k that satisfies (5.139). 

Suppose for the moment that y,z € k satisfy |2/|, |^:| < 1 and 

(5.142) 1(1 + y)/{l + z) - 1| < 

In particular, this holds when 

(5.143) 

because (5.131) is a group with respect to mulitplication when 

(5.144) r = < 1. 


Under these conditions, we get that 


(5.145) |log(l+y)-log(l + 0 )| = |log((l+y)/(l + z))| 

< \i^+ y)/i^ +z)-l\ = \y - z\, 

using (5.114) in Section 5.7 in the first step, (5.141) in the second step, and 
(5.119) in the previous section in the third step. 

Note that 

(5.146) r max(r'’“^/|j|p) = max(r^“^/|j|p) < 1 

1>2 j>2 

when r > 0 satisfies (5.134), using (5.137) in the second step. This implies that 


(5.147) 


r max(r'’ ^/|j|p) = max(r^ VIjIp) < 1 


J>2 


1>2 


when 

(5.148) 

Let us check that 

(5.149) 

for every x € k with 

(5.150) 


0 < r < 

|log(l +x)| = \x\ 


|x| <p-^/^P-^\ 

As in (5.121) in the previous section, we have that 


(5.151) I log(l + x) — x\ = 




J=2 


J 


< max|a;Vj| = max(|a:|V|j|p), 

J>2 3>2 


for every x € k with |a;| < 1, using the current hypothesis about the absolute 
value function on k in the last step. We also have that 

max(|a;P"Vlj|p) < 1 

J>2 


(5.152) 



5.10. OUTER MEASURES 


115 


when X € k satisfies (5.150), by (5.147) with r = |a;|. Combining this with 
(5.151), we get that that 

(5.153) I log(l + x) — a:| < |a:| 

when X G k satisfies (5.150) and x ^ 0. As before, (5.149) follows from (5.153) 
when X G k satisfies (5.150) and a: ^ 0, as in (1.43) in Section 1.3, and of course 
(5.149) is trivial when a; = 0. 

Suppose for the moment that y,z € k satisfy \y\, \z\ < 1 and 

(5.154) 1(1 + y)/{l + 0) - 1| < 
which holds in particular when 

(5.155) |y|,|z| 

because (5.130) is a group when r is as in (5.144). Under these conditions, we 
get that 

(5.156) |log(l + y)-log(l + 2 )| = |log((l + y)/(l + 2 ))| 

= \{^ + y)/(.^ + z)-l\ = \y-z\, 

using (5.114) in Section 5.7 in the first step, (5.149) in the second step, and 
(5.119) in the third step. 

Put 

(5.157) f(x) =log{l + x) 

for every x € k with |x| < 1 again, as in the previous section. Using Hensel’s 
lemma, one can check that 

(5.158) /(B(0,p-^/(P-i))) = 

More precisely, one can show that / maps B{0,t) onto itself when 

(5.159) 0 < t < 

as in (4.136) in Section 4.10, with a;o = 0. This also corresponds to (4.158) in 
Section 4.11, with ri = 

5.10 Outer measures 

Let A be a set, and let /i be a function defined on the collection of all subsets of 
X with values in the set of nonnegative extended real numbers. If /i satisfies the 
following three conditions, then p is said to be an outer measure on X. First, 

(5.160) /x(0) = 0. 

Second, if A C C A, then 

(5.161) pl{A)<p{B). 



116 


CHAPTER 5. SOME ADDITIONAL TOPICS 


Third, if Ai,A 2 , A ^,... is any infinite sequence of subsets of X, then 

OO OO 

(5.162) 

i=i i=i 

where the sum on the right side is interpreted as in Section 1.10. In particular, 
the sum on the right side of (5.162) is automatically interpreted as being +oo 
when fi{Aj) = +oo for any j. Otherwise, if fJ^{Aj) < oo for each j, then the 
sum on the right side of (5.162) may be considered as an ordinary infinite series 
of nonnegative real numbers, which is equal to +oo when the series does not 
converge in the usual sense in R. Of course, (5.162) holds trivially when the 
sum on the right side is equal to +oo. 

The second condition (5.161) may be described as monotonicity of /i, and the 
third condition (5.162) is known as countable subadditivity. If Ai,A 2 ,... ,A„ 
is any Hnite sequence of subsets of X, then (5.160) and (5.162) imply that 

n n 

(5.163) IJ Aj^ < '^n{Aj), 

i=i i=i 

by taking Aj = 0 when j > n. This may be described as finite subadditivity of 
fi. As before, the sum on the right side of (5.163) is interpreted as being equal 
to +00 when /i(Aj) = +oo for any j = 1,2,... ,n, in which case (5.163) holds 
trivially. 

If A is a subset of X, and Ax, A 2 , A^,... is an infinite sequence of subsets of 
X such that 

OO 

(5.164) ^ ^ U 

i=i 

then (5.161) and (5.162) imply that 

OO 

(5.165) ^i{A) < E 

i=i 

with B = Of course, this property implies (5.162), by taking A = 

One can also get (5.161) from (5.165) and (5.160), by taking Ai = 
B and Aj = 0 when j > 1. Thus the definition of an outer measure can 
be equivalently formulated in terms of (5.160) and (5.165), instead of (5.160), 
(5.161), and (5.162). 

Let / be a countably inhnite set, and suppose that for each j S I, Aj is a 
subset of X. The countable subadditivity property (5.162) can be reformulated 
as saying that 

(5.166) IJ Aj'^ < 

j&i jei 

under these conditions, where the sum on the right side of (5.166) is defined as 
in Section 1.10. Similarly, (5.165) can be reformulated as saying that 

KA) < ^M(^i) 
jei 


(5.167) 



5.10. OUTER MEASURES 


117 


when A C IJ^gj Aj. In both cases, one may as well allow / to be a nonempty 
set with only finitely or countably many elements, using (5.160), as before. 
One could even allow / to be the empty set, and interpret any sum over I as 
being equal to 0, and any union over I as being the empty set. With these 
interpretations, (5.166) or (5.167) may be considered to imply that ^(0) < 0, 
and hence (5.160), since fj, is supposed to be nonnegative by hypothesis. Thus 
the notion of an outer measure may be defined in terms of (5.167), where / is 
allowed to be any set with only finitely or countably many elements, including 
the empty set. Strictly speaking, it is better to consider finite or countable 
collections of subsets of X, so that an auxiliary set I of indices is not needed. 

A subset S of AT is said to be measurable with respect to an outer measure 
11 on X ii 

(5.168) ^l{A) = ^i{AnB) + ^i{A\B) 

for every subset A of X. It is well known that the collection of subsets of X that 
are measuable with respect to /x forms a cr-algebra, and that the restriction of 
^ to this cr-algebra of measurable sets is countably additive. Note that B C X 
is measurable when n{B) = 0. An outer measure ^ on a topological space X is 
said to be a Borel outer measure if every Borel subset of X is measurable with 
respect to /x. 

Suppose for the moment that {X, d{x,y)) is a metric space. If A and B are 
nonempty subsets of X, then the distance between A and B is defined by 

(5.169) dist(A, B) = inf{d{x, y) : x G A,y G B}. 

Thus dist(A, B) > 0 if and only if there is an 77 > 0 such that 

(5.170) d{x,y)>r] 

for every x & A and y & B, which implies in particular that A and B are disjoint. 
An outer measure /x on AT is said to be a metrie outer measure if 

(5.171) y{AO B) = fj.{A) + fj.{B) 

for every pair of nonempty subsets A, B of X with dist(A, B) > 0. Of course, 
it suffices to show that 

(5.172) fj.{AUB)>y{A) + y{B), 

since the opposite inequality follows from finite subadditivity, as in (5.163). If 
/i is a metric outer measure on X, then it is well known that the Borel sets in 
X are measurable with respect to y. This is called Caratheodory’s criterion. 
Let A be a nonempty proper subset of X such that 

(5.173) dist(L;,A:\L;) > 0, 

which implies that E is both open and closed in AT, so that X is not connected. 
If 7 x is a metric doubling measure on X, then it is easy to see directly that 



118 


CHAPTER 5. SOME ADDITIONAL TOPICS 


E is measurable with respect to /i, which is a special case of Caratheodory’s 
criterion. Note that (5.173) holds when E is a nonempty proper compact open 
subset of X. If d{x,y) is an ultrametric on X and E is an open or closed ball 
in X of positive radius which is a proper subset of X, then E satisfies (5.173). 
Similarly, if d{x, y) is an ultrametric on X and ill is a proper nonempty subset 
of X that can be expressed as the union of a family of balls of a fixed positive 
radius, then E satisfies (5.173). 

An outer measure /i on a set X is said to be regular if for each subset A of 
X there is a subset B ol X that is measurable with respect to /x and satisfies 

(5.174) ACB 
and 

(5.175) yiA)=y{B). 

Similarly, an outer measure /i on a topological space X is said to be Borel 
regular if /x is a Borel outer measure on X, and if for each A C X there is a 
Borel set B C X that satisfies (5.174) and (5.175). Of course, (5.174) implies 
that /x(A) < y{B), as in (5.161), and so it suffices to check that the opposite 
inequality holds to get (5.175). In both cases, if y.{A) = +oo, then one can 
simply take B = X. Some texts use the term “measure” for what is called an 
outer measure here, and then use the adjectives “Borel”, “regular”, and “Borel 
regular” as defined here. Otherwise, the term “measure” is often used for a 
countably-additive nonnegative extended-real-valued function defined an a a- 
algebra of “measurable” subsets of a set X , for which the measure of the empty 
set is equal to 0. In this terminology, a Borel measure is a measure defined on 
the (J-algebra of Borel subsets of a topological space X, and somewhat different 
regularity properties are typically considered, especially for Borel measures on 
locally compact Hausdorff topological spaces. 

Let /X be an outer measure on a set X, and let Ai, A 2 , A 3 ,... be a sequence 
of subsets of X such that 

(5.176) Aj C Aj+i 

for each j. Thus 

00 

(5.177) yiA,) < ^i{A,+,) < y(^\J Ai) 

1^1 

for each j > 1 , and hence 

oo 

(5.178) sup^(Aj) < IJ Aj^ 

^-1 

Note that ir{Aj) tends to the supremum as j —>■ oo, because y{Aj) increases 
monotonically in j, with the usual interpretations for extended real numbers. 
Suppose that Aj is measurable with respect to /x for each j > 1, which implies 
that Aj \ Aj_i is measurable with respect to y for each j > 2. The sets Aj \ Aj_i 



5.11. HA USDORFF MEASURES 


119 


with j > 2 are pairwise disjoint, because of (5.176), and disjoint from Ai. We 
also have that 

j 

(5.179) A, =Aiu(|J(A,\^/-i)) 

1^2 

for each j > 1, with suitable interpretations when j = 1, and that 

oo oo 

(5.180) =^iu(U(^A^;-i))- 

i=i i=2 


This implies that 

j 

(5.181) + E fj.{Ai \ Ai-i) 

1=2 

for each j > 1, with suitable interpretations when j = 1, and that 

OO OO 

(5.182) /x( U + E \ 

j=l 1=2 

because /r is countably additive on measurable sets. It follows that 

OO 

(5.183) lim n{Aj) = fj ^j\ 

J^OO 

Of course, this is a standard fact about countably-additive measures on a- 
algebras of measurable sets. 

If /i is a regular outer measure on X, then it is well known that (5.183) holds 
even when the Aj’s are not asked to be measurable with respect to /r. We already 
have (5.178), and so the point is to show that the opposite inequality holds as 
well. The regularity of fx implies that each Aj is contained in a measurable 
set with the same measure, and so one would like to use this to reduce to the 
previous case of measurable sets. However, one should be a bit careful to choose 
these measurable sets so that they are also monotonically increasing with respect 
to inclusion, which is not too difficult to do. 


5.11 Hausdorff measures 

Let {M,d{x,y)) be a metric space, and let a be a nonnegative real number. 
Remember that a subset of M is said to be bounded if it is contained in a ball 
of finite radius. The diameter of a nonempty bounded set A C M is defined by 

(5.184) dia,mA = sup{d{x,y):x,y€A}, 


in which case 

(5.185) 


(diamH)" 



120 


CHAPTER 5. SOME ADDITIONAL TOPICS 


can be defined in the usual way for each a > 0. Let us interpret (5.185) as being 
equal to 1 when A is bounded and nonempty, even if A has only one element, 
so that diamA = 0. II A = %, then we interpret (5.185) as being equal to 0 for 
every a > 0, and we interpret (5.185) as being equal to +(X) for every a > 0 
when A is unbounded. 

The a-dimensional Hausdorff content H^^^{E) of a set E C M is defined to 
be the infimum of the sums 

(5.186) ^(diamAj)“ 

j 

over all collections {Aj}j of finitely or countably many subsets of M such that 

(5.187) Ec\Ja,. 

3 

More precisely, the sum (5.186) can be defined as a nonnegative extended real 
number as in Section 1.10, and H^^^{E) is also a nonnegative extended real 
number. Note that 

(5.188) HZ^iE) < (diamL;)“ 

for every E G M, since one can use the covering of E by itself in the previous 
definition. It is not difficult to verify that is an outer measure on M, by 
standard arguments. Basically, is the largest possible outer measure on 
M that satisfies (5.188). 

Similarly, Hg(E) is defined for 0 < <5 < +cx) as the infimum of the sums 
(5.186) over all collections {Aj}j of finitely or countably many subsets of M 
that satisfy (5.187) and 

(5.189) diam Aj < S 

for each j, when there are such coverings of E in M. Otherwise, if there is no 
such covering of E in M, then we simply put Hf{E) = +cx). Note that 

(5.190) H^{E) = H^,^{E) 

for every E C M, and that Hf{E) decreases monotonically in 5. One can also 
check that Hf is an outer measure on M for every d > 0, as before. 

Remember that M is said to be separable if there is a dense subset of M with 
only finitely or countably many elements. This implies that M can be covered 
by finitely or countably many subsets with diameter less than <5 for each d > 0, 
and in fact this condition is equivalent to separability. It follows that every 
subset of M can be covered in this way when M is separable. 

The a-dimensional Hausdorff measure oi E C M is defined by 

(5.191) H‘^{E) =snpH^{E), 

s>o 

which can also be considered as the limit of Hf{E) as 5 —>■ 0, because of the 
monotonicity of Hf{E) in 5. It is easy to see that H°‘ is an outer measure on 
M, since Hf{E) is an outer measure on M for each d > 0. If H°‘{E) = 0 for 



5.11. HA USDORFF MEASURES 


121 


some E C M, then Hf{E) = 0 for each (5 > 0, and in particular = 

0. Conversely, if = 0, then the coverings of E for which the sums 

(5.186) are small involve subsets Aj of M with small diameter. This implies 
that Hg(E) = 0 for every <5 > 0, and hence that H°‘{E) = 0. 

Suppose that Ei and E 2 are nonempty subsets of M such that 

(5.192) d{x,y)>r] 

for some rj > 0 and every x € Ei and y € E 2 . If ^ is any subset of M with 
diameter less than rj, then it follows that A cannot intersect both Ei and E 2 . 
Using this, one can check that 

(5.193) H^iEi U E 2 ) > H^iEi) + Hf{E2) 

when 0 < (5 < 77 . More precisely, if d < ry, then any covering of Ei U E 2 
by finitely or countably many subsets of M with diameter less than 5 can be 
split into coverings of Ei and E 2 separately. This leads to a splitting of the 
corresponding sums (5.186), which can be used to obtain (5.193). Under these 
conditions, we get that 

(5.194) H^{Ei{JE2)>H^{Ei) + H^{E2), 

by taking the limit as d 0. Thus i/“ satisfies Caratheodory’s criterion, which 
implies that Borel subsets of M are measurable with respect to i?“. 

One can check that i/“ reduces to counting measure on M when a = 0, using 
the conventions for defining (5.185) when a = 0 mentioned at the beginning of 
the section. In particular, to get i/°(0) = 0, one can cover the empty set by 
itself. Otherwise, one can let the empty set be covered by the empty collection 
of subsets of M, and interpret an empty sum as being equal to 0, as before. 

It is easy to see that the diameter of a set A C M is equal to the diameter of 
the closure A of A in M. One can also show that every set A C M is contained 
in open subsets of M with diameter arbitrarily close to the diameter of A. This 
implies that one can restrict one’s attention to coverings of a set E C M hy 
open or closed subsets of M in the definitions of H^^^{E) and Hf{E), and get 
the same result as before. If E is compact, then it follows that one can restrict 
one’s attention to coverings of E by finitely many subsets of M in the definitions 
ofi/“„(U) and if,“(U). 

Note that Hg is often defined using the condition 

(5.195) diam Aj < 6 

instead of (5.189), which leads to an equivalent definition of ii“. An advantage 
of using (5.189) is that one can more easily restrict one’s attention to coverings 
by open subsets of M, as in the previous paragraph. Otherwise, if one uses 

(5.195) and restricts one’s attention to covering by open sets, then one gets the 
same result for H°‘ in the limit as d 0, but not necessarily for each <5 > 0 
individually. 



122 


CHAPTER 5. SOME ADDITIONAL TOPICS 


If M = R with the standard metric, then one can restrict one’s attention to 
coverings of i? C R by intervals in the definitions of H“^„(E) and Hg{E). This 
is especially nice when a = 1, for which one can check that Hg = for each 

5 > 0, by subdividing intervals into smaller pieces. It follows that = Rcon 

on R, which is the same as Lebesgue outer measure on R. If a = 0, then it 
is helpful to consider the empty set as an interval in R, to deal with the case 
where E = %. Otherwise, one can avoid the problem by allowing the empty set 
to be covered by the empty collection of subsets of R. 

If A is a bounded subset of any (nonempty) metric space M, then A is 
contained in a closed ball in M with radius equal to the diameter of A. Any 
closed ball B in M with radius r has diameter less than or equal to 2 r, by the 
triangle inequality. If d{x, y) is an ultrametric on M, then the diameter of a 
closed ball in M of radius r is less than or equal to r. In this case, it follows that 
one can restrict one’s attention to coverings of a set E C M hy closed balls in 
M in the definitions of H'^g^{E) and Hg{E) , and get the same result as before, 
at least if a > 0. If a = 0, then one should allow the empty set to be covered 
by itself, or by the empty collection of subsets of M, as usual. 

Let (M, d{x, y)) be an arbitrary metric space again, and let T be a subset of 
M. Of course, Y can also be considered as a metric space, using the restriction 
of d{x,y) to x,y G Y. If A C Y, then one can restrict one’s attention to 
coverings of E by subsets of Y in the definition of H'^g^{E) and Hf{E), and get 
the same result as for A as a subset of M. More precisely, every covering of E 
in Y can be considered as a covering of A in M, and every covering of A in M 
leads to a covering of A in T, by taking the intersections of the subsets of M 
in the covering of E with Y. It follows that the definitions of H°^^{E), Hf{E), 
and H°‘ (E) for A as a subset of Y are equivalent to the analogous definitions 
for A as a subset of M. 

Let A be a subset of M such that Hf{E) < oo for some 5 > 0. By definition 
of Hg{E), for each e > 0 there is a collection {Aj}j of finitely or countably many 
subsets of M such that E is contained in the union of the A^-’s, the diameter of 
Aj is less than 6 for each j , and 

(5.196) ^(diam Aj)“ < H^{E) + e. 

3 

As before, we can also choose the A^’s to be open subsets of M, so that 

(5.197) u = U{e,5) =\jAj 

3 

is an open set in M as well. By construction, 

(5.198) ^^“(C/) < ^(diamAj)“, 

3 

since the Aj^s can be nsed to cover U in the definition of Hf{U) too. 



5.12. HAUSDORFF MEASURES, CONTINUED 


123 


Suppose now that H°‘{E) < oo, so that Hg{E) < oo for each <5 > 0. Thus 
we can apply the remarks in the previous paragraph to e = <5 = 1/n for each 
positive integer n, to get an open set Un C M such that E CUn and 

(5.199) + 1/n < H^{E) + 1/n. 

Put 

OO 

(5.200) E=f]Un, 

n—1 

^ that E IS a. Gs set in M, and hence a Borel set, and E C E. Of course, 
E CUn for each n, which implies that 

(5.201) < H^,n{Un) 

for each n. Combining this with (5.199) and taking the limit as n —> oo, we get 
that _ 

(5.202) H°‘{E) < H°‘{E). 

It follows that 

(5.203) H°‘{E) = H°‘{E), 

since the opposite inequality holds automatically, because E C E. This shows 
that is Borel regular as an outer measure on X. 

5.12 Hausdorff measures, continued 

Let fc be a held, and let | • | be an absolute value function on k which is nontrivial 
and discrete. As in Section 1.9, this implies that there is a real number pi such 
that 0 < pi < 1 and the positive values of | • | on fc are the same as the integer 
powers of pi. Remember that every closed ball in k of radius r > 0 with respect 
to the ultrametric associated to | • | has diameter less than or equal to r, as 
mentioned in the previous section. In this situation, if r is an integer power of 
Pi, then the diameter of a closed ball in k of radius r is equal to r. Of course, 
every closed ball in k of positive radius is the same as a closed ball with radius 
equal to an integer power of pi. 

Suppose in addition that the residue held associated to | • | on /c as in Section 
3.10 has exactly N elements for some integer N > 2. Let a be the real number 
determined by 

(5.204) p? = 1/N, 

and observe that a > 0 under these conditions. Also let Hf, and iL“ be 

the outer measures on k corresponding to the ultrametric on k associated to | ■ | 
as in the previous section. If j, I are integers and I > 0, then every closed ball 
in k of radius p{ can be expressed as the union of N^ pairwise-disjoint closed 
balls of radius p{'^\ as in Section 3.10 again. This implies that 

iL,“(B(x,pi))<lV' {p{+Y=Pr 


(5.205) 



124 


CHAPTER 5. SOME ADDITIONAL TOPICS 


for every x € k when < 6, using the definition of Hg in the first step, and 
(5.204) in the second step. It follows that 

(5.206) H'^(B{x,p{)) < 

for every x G k and j € Z, by taking the supremum of the left side of (5.205) 
over 5 > 0. Let us check that 

(5.207) H^{E)=H^,JE) 

for every E Q k and 5 > 0, so that 

(5.208) H^{E)=H:^^{E). 

Of course, H'^g^{E) is automatically less than or equal to Hg{E) for each d > 0, 
and so it suffices to verify the opposite inequality. One way to do this is to 
observe that any covering of E by finitely or countably many closed balls can 
be replaced by a covering of E by closed balls with arbitrarily small radius, by 
covering each ball by balls of smaller radius, as before. This keeps the same 
sum as in (5.186), because of the way that a was chosen. Alternatively, one can 
use the definition of i7^„(i?), (5.205), and countable subadditivity of Hg. 

Let us now ask that k be complete with respect to the ultrametric associated 
to I • I, in addition to the hypotheses mentioned earlier. It follows that closed 
balls in k are compact, because closed totally bounded subsets of a complete 
metric space are compact. In this case, it is not too difficult to show that 

(5.209) H^,„(Bix,pi))>pr 

for every x € k and j S Z. More precisely, because B(x,p\) is compact, it 
suffices to consider coverings of B{x,p\) by finitely many open subsets of k in 
the definition of H^^^{B{x, p\)), as in the previous section. In fact, it suffices 
to consider coverings of B{x, p{) by finitely many closed balls of positive radius, 
because the metric on k is an ultrametric. In the present situation, one can 
reduce further to coverings of B{x, p{) by finitely many closed balls of the same 
radius p{'^’' for some I G Z+, since one can cover balls of arbitrary radius by 
balls with smaller radius as before. This also uses the way that a was chosen, 
to ensure that sums like (5.186) are maintained when one reduces to covering 
by balls of smaller radius. However, one can check that B{x,p{) cannot be 
covered by fewer that iV* closed balls of radius for any I G Z+ under these 
conditions. This implies (5.209), as desired. 

Combining (5.206) and (5.209), we get that 

(5.210) H^(B{x,p{)) = pr 

for every x G k and j S Z. Of course, Hausdorff measure of any dimension is 
invariant under isometries on any metric space, by construction. In particular, 
Hausdorff measure of any dimension is invariant under translations on k. This 
implies that i7“ satisfies the requirements of Haar measure on k, because 
is finite on bounded subsets of k, and positive on nonempty open subsets of k, 
by (5.210). 



5.13. LIPSCHITZ MAPPINGS, REVISITED 


125 


5.13 Lipschitz mappings, revisited 

Let {Mi,di{x,y)) and {M 2 ,d 2 {u,v)) be metric spaces, and suppose that / is a 
Lipschitz mapping of order a > 0 from Mi into M 2 with constant C. If ri is a 
bounded subset of Mi, then f{A) is a bounded subset of M 2 , and 

(5.211) diam/(ri) < (^(diamri)®, 

where the diameters in (5.211) are taken in the appropriate metric space. Using 
this, one can check that 

(5.212) if“„(/(U))<C“if““(£;) 

for every E C Mi and a > 0, where the Hausdorff content on each side is taken 
in the appropriate metric space. More precisely, C°‘ should be interpreted as 
being equal to 1 for every C > 0 when a = 0. If a > 0 and (7 = 0, then the right 
side of (5.216) may be interpreted as being equal to 0, even when iJ““(U) = 00 . 
Let d > 0 be given, and suppose that 

(5.213) 5'>0 and S'>CS^. 

In analogy with (5.212), we have that 

(5.214) H^,(f(E)) <C^Hr(E) 

for every E C Mi and a > 0, and with the same conventions as before. If 

C > 0, then one might as well take 

(5.215) 6'= 06'^, 

and otherwise one can take any d' > 0 when (7 = 0. It follows that 

(5.216) i7“(/(U)) < (7“ iJ““(U) 

for every E C Mi and a > 0, since <5 > 0 is arbitrary. 

Suppose now that 

(5.217) C~^ di{x,y)°' < d 2 {f (x), f (y)) < Cdi{x,yY 

for some a > 0 and (7 > 1, and for every x,y & Mi. This implies that 

(5.218) C-^ (diam ri)“ < diam f{A)<C (diam ri)“ 

for every A C Mi, as in (5.211). As in Section 4.4, the inverse of / is Lipschitz 
of order 1/a with constant (7^/“ as a mapping from /(Mi) into Mi, because of 
the first inequality in (5.217). Remember that the Hausdorff content of f{E) 
as a subset of M 2 is the same as the Hausdorff content of f{E) as a subset of 
/(Ml), as in Section 5.11, because f{E) C /(Mi) C M 2 . Applying (5.212) to 
the inverse of /, we get that 

(5.219) M““(A) < (Ci/“)““ M(r)/“(/(A)) = H^,^{f{E)) 



126 


CHAPTER 5. SOME ADDITIONAL TOPICS 


for every E C Mi and a > 0. More precisely, we are applying (5.212) to f{E) 
instead of E, to aa instead of a, to l/o instead of o, and to instead of C. 
It follows that 

(5.220) i/““(£;) < < C" ^““(i^) 

for every E C Mi and a > 0, where the first inequality is equivalent to (5.219), 
and the second inequality is the same as (5.212). 

Similarly, let (52 > 0 be given, and put 

(5.221) ,5i = C'1/“(52^“, 
which is the same as saying that 

(5.222) (52 = C-^ (5^ 

One can check that 

(5.223) i?^i“(£5) < (C'i/“)““i/j““^/“(/(£;)) = C°‘Hg{f{E)) 

for every E C Mi and (a > 0, by applying (5.214) to the inverse of /, with 82 , 
(5i in place of i5. S', respectively. As before, we are also applying (5.214) to f{E) 
instead of E, to a a instead of a, to 1/a instead of a, and to instead of C. 
This implies that 

(5.224) i/““(A) < H°^{f{E)) 

for every E C Mi and a > 0, which could be derived from (5.216) applied to 
the inverse of / as well, with the same substitutions as before. It follows that 

(5.225) C'-“i7““(A) < H°‘{f{E)) < C°‘H‘^‘'{E) 

for every E C M and a > 0, using (5.224) in the first step, and (5.216) in the 
second step. 

5.14 Local Lipschitz conditions 

Let {Ml, di{x, y)), {M 2 , d 2 {u, v)) be metric spaces again, let a, rj be positive real 
numbers, and let C he a, nonnegative real number. A mapping / : Mi —>■ M 2 is 
said to be locally Lipschitz of order a at the scale of rj with constant C if 

(5.226) d2{f{x),f{y))<Cdi{x,yr 

for every x,y G Mi with 

(5.227) di{x,y)<ri. 

Alternatively, one might prefer to ask that (5.226) hold for every x,y G Mi with 

(5.228) di{x,y)<r]. 



5.14. LOCAL LIPSCHITZ CONDITIONS 


127 


instead of (5.227). This would imply that (5.226) holds for all x,y £ Mi that 
can be approximated by elements of Mi at distance strictly less than 77 , by 
continuity, li x,y £ Mi can be approximated by elements of Mi at distance 
strictly less than 77 , then x, y satisfy (5.227), but the converse does not always 
hold. 

Suppose that / : Mi -£ M 2 is locally Lipschitz of order a > 0 at the scale of 
77 > 0 and with constant C > 0. It is easy to see that (5.211) still holds for any 
bounded set A C Mi with 

(5.229) diam^ < 77 . 

If 

(5.230) 0 < (5 < 77 

and S' satisfies (5.213), then (5.214) still holds for every E C Mi and a > 0, for 
essentially the same reasons as before. This implies that (5.216) holds for every 
E C Ml and a > 0, basically by taking <5 and S' arbitrarily small, as before. 
If the local Lipschitz condition is defined in terms of (5.228) instead of (5.227), 
then (5.211) holds when 

(5.231) diam^ < 77 , 

instead of (5.229). In this case, (5.214) still holds for every E C Mi and a > 0 
when S and S' satisfy (5.230) and (5.213), because of the strict inequality in 
(5.189) in Section 5.11. If one were to also use the non-strict inequality (5.195) 
instead of the strict inequality (5.189) in the definition of these outer measures, 
then one should ask that 

(5.232) 0 < (5 < 77 

instead of (5.230), in order to get (5.214). In each variant, one still gets (5.216) 
for every E C Mi and a > 0, by taking S and S' to be arbitrarily small. 

Let us now restrict our attention to a = 1, in which case we may simply say 
that a mapping is locally Lipschitz at the scale of 77 with constant C. Let / be 
a mapping from Mi into M 2 again, and let x be an element of Mi. Using the 
notation in Section 4.2, we have that 

(5.233) DrU){x) < C 

for 0 < r < 77 if and only if 

(5.234) D^{f){x) < C, 

which happens if and only if (5.226) holds for every y £ Mi that satishes (5.227). 
In particular, / is locally Lipschitz at the scale of 77 with constant C on Mi if 
and only if (5.233) holds for every x £ Mi and 0 < r < 77 , which is the same as 
saying that (5.234) holds for every x £ Mi. It follows that the restriction of / 
to _ 

(5.235) {x£Mi-. Dr,{f){x) < C} 

is automatically locally Lipschitz at the scale of 77 with constant C. If one defines 
local Lipschitz conditions in terms of (5.228) instead of (5.227), then it is better 
to consider 0 < r < 77 in (5.233), and 


(5.236) 


DtifKx) < C 



128 


CHAPTER 5. SOME ADDITIONAL TOPICS 


for 0 < i < ry instead of (5.234). Alternatively, one could modify the definition 
(4.14) of Dr(f)(x), by taking the supremum over y G Mi with di{x,y) < r. 
One could still define Dt{f){x) as in (4.15), using the modified definition of 
Dr{f){x), or by taking the supremum over 0 < r < < of either definition of 
Dr{f){x). In the second characterization (4.16) of Dt{f){x), one should then 
take the supremum over y € Mi such that 0 < di{x,y) < t, when there is 
such a point y. This second characterization of Dt{f){x) corresponds to taking 
r > di{x,y) with r close to di{x,y) in the definition of Dt{f){x), instead of 
r = di(x,y), as before. 

Note that 

(5.237) D{f)ix) < C 

for some x G Mi and C" > 0 if and only if there is a t > 0 such that 

(5.238) A(/)(cr) < C', 
by the definition (4.17) of D{f){x). Thus 

(5.239) {x G Ml : D{f)ix) < C'} 
is the same as the union of 

(5.240) {x G Ml : A(/)(x) < C'} 

over t > 0. More precisely, it suffices to use a sequence of positive real numbers 
t that converges to 0. One can also check that D{f){x) would not be affected by 
modifying the definitions of Dr(f){x) or Dt{f){x) as in the preceding paragraph, 
using strict inequalities instead of non-strict inequalities in the relevant suprema. 
This would correspond to slightly different interpretations of (5.238) and (5.240). 



Bibliography 


[1] G. Birkhoff and S. Mac Lane, A Survey of Modern Algebra, 4th edition, 
Macmillan, 1977. 

[2] J. Cassels, Local Fields, Cambridge University Press, 1986. 

[3] R. Coifman and G. Weiss, Analyse Harmonique Non-Commutative sur Cer¬ 
tains Espaces Homoqenes, Lecture Notes in Mathematics 242, Springer- 
Verlag, 1971. 

[4] R. Coifman and G. Weiss, Extensions of Hardy spaces and their use in 
analysis. Bulletin of the American Mathematical Society 83 (1977), 569- 
645. 

[5] G. David and S. Semmes, Fractured Fractals and Broken Dreams: Self- 
Similar Ceometry through Metric and Measure, Oxford University Press, 
1997. 

[6] L. Evans and R. Gariepy, Measure Theory and Fine Properties of Functions, 
CRC Press, 1992. 

[7] K. Falconer, The Geometry of Fractal Sets, Cambridge University Press, 
1986. 

[8] K. Falconer, Fractal Geometry: Mathematical Foundations and Applica¬ 
tions, 3rd edition, Wiley, 2014. 

[9] H. Federer, Geometric Measure Theory, Springer-Verlag, 1969. 

[10] G. Folland, A Course in Abstract Harmonic Analysis, CRC Press, 1995. 

[11] G. Folland, Real Analysis, 2nd edition, Wiley, 1999. 

[12] F. Gouvea, p-Adic Numbers: An Introduction, 2nd edition, Springer-Verlag, 
1997. 

[13] J. Heinonen, Lectures on Analysis on Metric Spaces, Springer-Verlag, 2001. 

[14] E. Hewitt and K. Ross, Abstract Harmonic Analysis, Volumes I, II, 
Springer-Verlag, 1970, 1979. 


129 



130 


BIBLIOGRAPHY 


[15] E. Hewitt and K. Stromberg, Real and Abstract Analysis, Springer-Verlag, 

1975. 

[16] 1. Kaplansky, Set Theory and Metric Spaces, 2nd edition, Chelsea, 1977. 

[17] S. Krantz, A Panorama of Harmonic Analysis, Mathematical Association 
of America, 1999. 

[18] S. Krantz and H. Parks, The Geometry of Domains in Spaee, Birkhauser, 
1999. 

[19] R. Macias and C. Segovia, Lipschitz functions on spaces of homogeneous 
type. Advances in Mathematics 33 (1979), 257-270. 

[20] S. Mac Lane and G. Birkhoff, Algebra, 3rd edition, Chelsea, 1988. 

[21] P. Mattila, Geometry of Sets and Measures in Euclidean Spaces, Cambridge 
University Press, 1995. 

[22] W. Rudin, Principles of Mathematical Analysis, 3rd edition, McGraw-Hill, 

1976. 

[23] W. Rudin, Real and Complex Analysis, 3rd edition, McGraw-Hill, 1987. 

[24] W. Rudin, Fourier Analysis on Groups, Wiley, 1990. 

[25] W. Rudin, Functional Analysis, 2nd edition, McGraw-Hill, 1991. 

[26] E. Stein, Singular Integrals and Differentiability Properties of Functions, 
Princeton University Press, 1970. 

[27] E. Stein, Harmonic Analysis: Real-Variable Methods, Orthogonality, and 
Oscillatory Integrals, with the assistance of T. Murphy, Princeton Univer¬ 
sity Press, 1993. 

[28] E. Stein and R. Shakarchi, Fourier Analysis: An Introduction, Princeton 
University Press, 2003. 

[29] E. Stein and R. Shakarchi, Complex Analysis, Princeton University Press, 
2003. 

[30] E. Stein and R. Shakarchi, Real Analysis: Measure Theory, Integration, 
and Hilbert Spaces, Princeton University Press, 2005. 

[31] E. Stein and R. Shakarchi, Functional Analysis: Introduction to Further 
Topics in Analysis, Princeton University Press, 2011. 

[32] E. Stein and G. Weiss, Introduction to Fourier Analysis on Euclidean 
Spaces, Princeton University Press, 1971. 


[33] M. Taibleson, Fourier Analysis on Local Fields, Princeton University Press, 
1975. 



